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DESCRIPTION 

METHODS OF PREPARING NUCLEIC ACIDS FOR 
MASS SPECT ROMETRIC ANALYSIS - - 
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CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation of U.S. Application Serial No. 60/032,369 filed 
December 2. 1996 and U.S. Application Serial No. 08/759,993 Hied December 2, 1996 which is 
a continuation-in-part of U.S. Application Serial No. 08/715,582 filed September 19. 1996. 

INTRODUCTION 

Approximately 4,000 human disorders are attributed to. genetic, causes. Hundreds of 
genes responsible for various disorders have been mapped, and sequence information is being 
accumulated n^)idly. A principal goal of the Human Genome Project is to find all genes 
associated with each disorder. The definitive diagnostic test for any specific genetic disease (or 
predisposition to disease) will be the identification of polymorphic variations in the DNA 
sequence of affected cells that result in alterations of gene function. Furthermore, response to 
specific medications may depend on the presence of polymorphisms. Developing DNA (or 
RNA) screening as a practical tool for medical diagnostics requires a method that is inexpensive, 
accurate, expeditious, and robust. 

Genetic polymorphisms and mutations can manifest themselves in several fonns, such as 
point polymorphisms or point mutations where a single base is changed to one of the three other 
bases, deletions where one or more bases arc removed from a nucleic acid sequence and the 
bases flankii^ the deleted sequence are directly linked to each other, insertions when new bases 
are inserted at a particular point in a nucleic acid sequence adding additional length to the overall 
sequence, and expansions and reductions of repeating sequence motifs. Large insertions and 
deletions, often the result of chromosomal receMnbinaUon and rearrangement events, can lead to 



WOf8/12355 ^„ 

2 PCT/US97/17101 

_ partial or coniplete loss Of a gene. Ofthesefon„sofpolymorphis™,i„ge„enUthemostdiff.a^^^ 
type of change to screen for and detect is the point polymorphism because it represents the 
smallest degree of moJecuJar change. 

Although a number of genetic defects can be linked to a specific single point mnMion 
^ithm a gene, sickle cell anemia, many are caused by a wide spectrum of different mutations 
thmughout the gene. A typical gene that might be screened could be anywhere fiom 1 000 to 
100,000basesinlen8th,thoughsmallerandlargerge„esdoexist. Of that amount of DNA only 
a fract.on-of the base pairs actually encode the protein. These discontinuous protein elding 
regions are called exons and the remainder of the gerie is referred to as introns. Of th«e 
types of regions, exons often contain the most important sequem.es to be screened Several 
complex procedu.^ have been developed for scannihg genes in order to detect polymorphisms 
These procedures are applicable to both exons and introns. 

In tem« of cunent use, most of the methods to scan or screen genes employ slab or 
capillary gel electrophoresis for the sepamUon ami detection step in the assays Gel 
eJectK,phoresis of nucleic acids primarily provides relative size :nformation based on mobility 
through the gel matrix. If calibration standards ^ employed, gel electrophoresis can be used to 
measur. absolute and relative moleeular weights of large biomoiecules with some moderate 
degree of accumcy; even then typically the accuracy is only 5% to 10%. Also the molecular 
weight resolution is limited. In cases where two DNA fragments with the identical number of 
base pam. can be separated, using high concentration polyacr^lamide gels, it is still not possible 
to .dentify which band on a gel corresponds to which DNA fragment without performing 
secondary labeling experiments Thus, gel electrophoresis techniques can only determine size 
and cam.ot provide any infom^ation about changes in ba«. composition or sequ^ce without 
perfonnmg more complex sequencing reacUons: Gel-based techniques, for the most pan are 
dependent on labeling or staining methods to visualize and discriminate between diffexen. 
nucleic acid fragments. 

All of dte methods in use today capable of screening broadly for geneUc polymorphisms 
suffer from technical complication and are labor and time intensive. Single strand 
conformational polymorphism (SSCP) (Orita « a/.. 1989), det^turing gradient gel 
clectmphoresis (DGGE) (Abrams e/ ,990). chemical cleavage ai mismatch (CCM) (Saleeba 
and Cotton. 1993). enzymatic mismatch cleavage (EMC) (Youil e/ al., 1995). and "cleavasc" 
augment length polymorphism (CFLP) procedures are currently gel-based, making them 
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cumbersome to automate and petfonn efficioitly. There is a need for new methods that can 
provide cost effective and expeditious means for screening genetic material in an effort to reduce 
medical expenses. 

The late 1 980*8 saw the rise of two new mass spectrometric techniques for successfiJIy 
measuring the masses of intact very large biomolecules. namely, matrix-assisted hiser 
desorption/ionization (MALDI) time-of-flight mass spectrometry (TOF MS) (Tanaka etaL, 
1988; Spengler e/a/., 1989) and electrospray ionization (ESI) combined with a variety of mass 
Malms (Fenn et al., 1 989). Both of these methods are suitable for genetic screening tests. The 
MALDI mass spectrometric technique can also be used with methods other than time-of-flight, 
for example, magnetic sector, Fourier-transform ion cyclotron resonance, quadrupolc; and 
quadrupole trap. One of the advances in MALDI analysis of polynucleotides was the discovery 
of 3-hydroxypicolimc acid ("3-HPA") as a matrix for mixed-base oligonucleotides (Wu. eial.. 
1993). 

MALDI-TOF MS involves laser pulses focused on a small sampje plate comprising 
analyte molecules (/.e. nucleic acids) embedded in either a solid or Uquid matrix which is 
tj^ically a small, highly absorbing m^erial. The laser pulses traiisfer energy to, the matrix 
calling a microscopic ablation and concomitant ionization of the , analyte molecules, producing a 
gaseous pliime of intact, charged nucleic acids in single-stranded form. If double-stranded 
nucleic acids are analyzed, the MALDI-TOF MS typically results in mostly denatured single- 
strand detection. The ions generated by the laser pulses are accelerated to a fixed kinetic energy 
by a strong electric field and then pass through an electric field-free region in vacuum, traveling 
with a velocity corresponding to their respective mass-to-charge ratios (m/z). Thus, the smaller 
m/z ions will travel through the vacuum region faster than the larger m/z ions thereby causing a 
separation. At the end of the electric field-free region, the ions collide with a detector that 
generates a signal as each set of ions of a particular mass-to-charge ratio strikes the detector. 
Usually for a given assay, 10 to 100 mass spectra resulting from individual laser pulses are 
summed together to make a single composite mass spectrum with an improved signal-to-noise 
ratio. 

The mass of an ion (such as a charged nucleic acid) is measured by using its velocity to 
determine the mass-to-charge ratio by time-of-flight analysis. In other words, the mass of the 
molecule directiy correlates with the time it takes to travel from the sample plate to the detector. 
The entire process takes only microseconds. In an automated apparatus, tens to hundreds of 
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samples can be analyzed per minute. In addition to speed. MALDI-TOF MS has one of the 
largest ttiass ranges for mass spcctromctric devices. The current mass range for MALDI-TOF 
MS is from 1 to 1.000,000 Daltons (Da) (measured recently for a protein) (Nelson « < 1995). 

The perfbnhance of a mass spectrometer is measured by its sensitivity, mass resolution 
and mass accuracy. Sensitivity is measured by the amount of material needed; it is generally 
desirable and possible with mass spectrometry to work with sample amounts in the femtomole 
and low picomble range. Mass resolution. m/Am, is the measure of an instnm«nfs ability to 
produce l^jparate signals fiom ions of similar iftass. Mass resolution is defined tts the inass. m, of 
an ion signal divided the foil width of the signal. Am, usually measured between points of 
half-maximum iritcrisity. Mass accuracy is the measure oferror in designating a mass to an ion 
sigual. The mass acbiiracy is deiihWi as the raiio of the mass assignment error divided by the 
liiass of the ion a:nd can be represented as a percentage. 

To be able to detect any poim polymorphism directly by MALDI-TOF mass 
spectrometry, one Would need to resolve and accurately measure the masses of nucleic acids in 
which a single bas<i change has occurred (in comparison to the wild type nucleic acid). A single 
ibase change cah be i mass difference of as little as 9 Da This value represents the difference 
between the two b^s with the closest mass values. A and T (A = 2'-deoxyadenosine-5'- 
phosphate = S13.19 Da; T= 2'-deoxythymidinc-5'-phosphate = 304.20 Da; G =2'- 
deoXyguanosine-5-phosphate = 329.21 Da: and C = 2'-deoxycytidine-5'-phosphate = 289.19 Da). 
If during the mutation process, a single A changes to T or a single T to A, the mutant nucleic 
acid containing the base transvcKion will either decrease or increase by 9 Da in total mass as 
compared to the wild type nucleic acid. For mass spectrometry to directly detect these 
u^sversions, it must therefore be able to detect a minimum mass change. Am, of approximately 
■9Da. ■ ■. •-•■'*■ 

For example, in order to fiilly resolve (which may not b^ necessary) a point-mutated 
(Ato T or T to A) Jjeterozygote 50-base single^stranded DNA fragmem having a mass, m, of 
^ 15,000 Da from its corresponding wild type nucleic acid, the required mass resolution is m/Am 
= 15.000/9 « 1,700. However, the mass accuracy needs to be significantly better than 9 Da to 
increase quality assurance and to prevent ambiguities where the measured mass value is near the 
half-way poim between the two theoretical masses. For an analyte of 15.000 Da. in practice the 
mass accuracy m»ds to be Am ~ ±3 Da = 6 Da. In this case, the absolute mass accuracy required 
is (6/l5:000)'100 = 0.04%. Often a distinguishing level of mass accuracy relative to another 
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kiiown peak in the specmun is sufficient to resolve ambiguities. For example, if there is a known 
mass peak 1000 Da ftpm the mass peak in question, the relative position of the unknown to the 
known peak may be knovyn with greater accuracy than that provided by an absolute, previous 
calibration of the mass spectrometer. 

In order for mass spectromeUry to be a useful tool for screening for polymorphisms in 
nucleic acids, several basic requirements should be met First, any nucleic acids to be analyzed 
should be purified to minimia: the presence of salt ions and other molecular contaminants. 
These-impurities may reduce the intensity and quality of die mass spectrometric signal to a point 
where either (i) the signal is undetectable or unreliable, or (ii),the mass accuracy and/or 
resolution is below the value necessary to detect the t^ of polymorphism expected. Second, 
the size of the nucleic acids to be analyzed should be within the range where then is sufficient 
mass resolution and accuracy. Mass accuracy and resolution significanUy degrade as the mass of 
the analyte increases. CurrenUy, the detection of single nucleotide polymorphisms (SNPs) above 
said mass value is difficult aboye a mass of approximately 30.000 Da for.oligonucleotides (~ 100 
bases). Third/because all molecules within a sample are visualized during mass spectrometric 
analysis (i.c, it is not possible to selectively label and visualize certain molecules and not others 
as one can with gel electrophoresis methods), nucl^c acid samples should be partitioned prior to 
analysis to remove unwanted nucleic acid products from the spectrum^ Fourth, the mass 
spectroraeuic methods for generalized nucleic ?«:id screening must be efficient and cost effective 
in order to screen a large number of nucleic acid bases in as few steps as possible. 

The methods for detecting nucleic acid polymorphisms known in the art do not satisfy 
these four requirements. For example, current methods for mass spectrometric analysis of DNA 
fragments have focused on double-stranded DNA fragments which result in complicated mass 
specua. making it difficult to resolve mass differences between two complemeiitary strands (see, 
e.g.. Tang et aL, 1994). Thus, there is a need for cost and time effective methods of detecting 
genetic polymorphisms using mass spectrometry, preferably MALDI or ESI, and with mass 
areuracy of a few parts in 10,000 or better. 

SUMMARY OF THE INVENTION 

1 his invention provides novel methods and kits for the screening of target nucleic acids 
and the identification of changes in base composition that might resuh from a genetic 
polymorphism. The present invention discloses novel processes focusing on the use of mass 
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spectrometiy as 6 genetic analysis tooJ and employing the unique properties of mass 
spectrometiy and MALDI-TOF MS, in particuiar, to separate different amplified single-stranded 
taiget niKleic acids and identify their mass exactly. SignificanUy. mass spectrometiy requires 
only minute samples, provides extremely detailed information about the molecules being 
analyzed including hi jghmas^ accuracy, and is easily autoniaied. 

The presem invemion encompasses several embodiments, such as (1) procedures for 
reducing the length of target nucleic acids by removing one or more flanking polynucleotide 
regionis that "flank," or are adjacent to or near, the regions of interest; (2) procedures for isolating 
either single-stianded or double-stranded target nucleic acids for inass spectrometric analysis; (3) 
procedures cbmBining these two aspects; and (4) kits for the methods described herein. 

The present invention encompasses several embodiinents, such as (1) piocedures for 
preparing a double-stianded taiBCl nucleic acid for mass spectrometric analysis; (2) prbadures 
for determining the mass of target nucleic acids, where the target nucleic acid may be either 
single^stranded or double-stranded; and (3) kits for preparing a double-stranded target nucleic 
acid for mass spectiomedic analysis; It w^II be understood by th(^e of skill in the art that where 
the nucleic acid is double-strandcd, the two strands are complementary to each other and are 
connected via hydrogen bonds along the sti^inds. 

An embodiment of the presem invention encompasses a method of determining the mass 
of a target nucleic acid by mass spectromietric analysis. This method generally includes: 
identifying a target nucleic acid; reducing the length of the target nucleic acid by cleaving at least 
a portion of one or ihore of the flanking regions to produce a reduced-length target nucleic acid; 
obtaining a single-stranded reduced-lengtK target nucleic acid; and determining the mass of the 
single-stranded reduced-length target nucleic acid using a mass spectrometer. Typically, the 
target nucleic acid will contain a region of interest and one or more flanking regiohs. 

A preferred embodiment encompasses amplifying the target nucleic acid prior to reducing 
the length of the target nucleic acid to produce an amplified target nuclac acid. The amplified 
target nucleic acid may be subsequently reduced in length and obtained in single-stranded form, 
free of its complement, for mass spectral analysis. The target nucleic acid may be ampUfied by 
any method known by one of skill in the art. for example, polymerase chain reaction ("PCR™", 
with PCR™ being a preferred amplification method. These methods are well known by those of 
skill in the art. 
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It is contemplated that one of skill in the art may use the methods of this invention to 
analyze more than one taiget nucleic acid simultaneously. As u^ hetein "a" wHI be 
understood to mean one or more. Thus, "a target nucleic acid" may refer, for example, to one, 
two. three, four, five or more target nucleic acids. Aspects of this invention,, dieiefore. include 
dcterniining .the mass of one single-stranded reduced-length target nucleic acid as well as 
detetmining the masses of multiple single-stranded reduced-length tai^et nucleic acids 
simultaneously or in seriatim. Where the masses of multiple single-stranded reduced-length 
target nucleic acids are being determined, each of the target nucleic acids may be reduced in 
length by the same or a different method. Similarly, the single-sstranded reduced-length target 
nucleic acids may be obtained from the reduced-lengUi target nucleic acids by the same or 
different methods. For example, if two target nucleic acids arc identified, or selected, for 
analysis, then these two target nucleic acids may both be reduced in length by an endonuclease. 
or one may be reduced in length by an endonuclease and the other by cleaving at a chemically 
cieavable site, and so on. 

The target nucleic acids encompassed by this invention will generally contain a region of 
interest and one or more flanking regions. A "region of intcsrest" refers to the region for which 
one is interested in determining the mass. For example, when the methods disclosed in this 
invention are employed to detect or screen for polymorphisms, the region of interest would be 
the region containing, or that is suspected of containing, a polymorphism. The flanking regions 
are the portions of DNA sequence on either side of the region of interest. 

For embodiments employing PCRtm primers and polymerases to amplify a target nucleic 
acid, the primer is often complementary to a portion of one or more flanking regions of the target 
nucleic acid to allow the primer to effectively anneal to the target nucleic acid and provide a site 
to extend a complement to the target nucleic acid via PCR™. Therefore, for the methods 
comprising amplification, it is preferred that at least one of the primers is complementary to a 
portion of a flanking region that is preferably adjacent to or close to the polynucleotide region of 
interest, generally within 40 nucleotides. 

When the methods of this invention are used to detect a polymorphism; the target niicleic 
acids employed in this invention may include any polynucleotide sequence that contains or is 
suspected of containing a polymorphism, including but not limited to short tandem repeats 
(STRs), simple sequence length polymorphisms (SSLP), single nucleotide polymorphisms 
(SNPs). and any of a multitude of disease markers, for example, markers for sickle cell anemia, 
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fragile X disorder, cystic Hbi^sis, Tay Sachs disease. Gaucher disease, thalassemias, and cancer- 
leJated genes. WhUe the target nucleic acids for use in cbnjuncuon vrfth the present invention 
may be douWe- or single-stranded. it is preferable that the nucleic acids be obtained in single- 
sinmdedfohn, free of its complementary stran^^ These single-stranded 

target nucleic acids nuQ. be any size that can be adequately resolved by mass spectiometric 
analysis. Preferably, in cases where a SNP is to be detected, the final product single-stranded 
amplified target nucleic acids are less than about 100 bases in length. More preferably, the final 
producL-single-stranded amplified target nucleic acids ai« fiom about 10 to 90 bases L length 
As used in this context, "about" means anywhere frorn i 1 to lO W pairs, and all the integers in 
between, for example, ±1 . ±2, ±3. ±4. ±5, ±6, ±7; ±8, ±9. or 

However, one of ordinary skill in the art will apineciate that as mass spectrometric 
techniques for analysis of nucleic acids improve, the sizes of single-stranded amplified target 
nucleic acids useful in this invention can be increased. The nanire of the mutaUon to be detected 
is also a factor in the size limitations for optimum mass resoluUon. For example, as described 
above for SNPs. the maximum siz«; limit iriay be approximately I ob nucleotides in length. 
However, for microsatellite repeats and bther two ri»k:lebtide repeats, the maximum size limit 
may be approximately 200 nucleotides in Jength, and the maximum size limit for four-nucleotide 
repeats may be approximately 300 nucleotides. 

The target nUcleic acids of this invention may be either double-stranded or single- 
Sttanded. As used herein, the phrase "obtaining a single-stranded reduced-length target nucleic 
acid" refers to isolating a single-stranded nucleic acid free from its complement for purposes of 
mass spectral analysis. Where the target nucleic acid is single-stranded, it will be understood by 
those of skill in the art that no further steps are nx,uired to obtain the single-stranded leduced- 
length target nucleic acid from the reduced- length target riucleic acid. However^ where the target 
nucleic acid is double-stranded, one of the two complementary strands must be sep«ated or 
isolated from the other such that only one of the two strands, is subjected to mass spectrometty, 
e.g., by binding one of the strands to a solid support denaturing the double-sirandcd nucleic acid 
and isolating either the bound or unbound suand free from its complement. This allows for 
greater mass resolution, simplifies the spectrum, and eliminates the collection of cumulative 
information. 
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■ Thetenncomplementaiyreferetothefoniiationofsuffidcnthyd^ 
two nucleic acids to stabUize a double-stranded nucleotide sequence formed by hybridization of 
the two nucleic acids. 

The methods for reducing the length of target nucleic acids eliminate unnecessary 
sequences and reduce the mass of the resulting single-stranded or double-stranded target nucleic 
acids, resulting in increased mass resolution and accuracy. 

Exemplary methods of reducing length include: cleaving at endogenous restriction 
endonuclease cleavable sites present in one or more flanking regions but absent in the region of 
interest: cleaving at restriction endonuclease cleavable sites which arc at or adjacent to restriction 
endonuclease recognition sites incorporated into one or more of the flanking regions where the 
cleavable sites are introduced imo the flanking regions using of one or more cleavable primers 
containing restriction endonuclease recognition sites within their sequences; cleaving at a 
combination of restriction endonuclease cleavable sites where the jsites are endogenous and/or 
introduced using mismatch or overhanging primers: selective digestion of one or more flanking 
regions using exonucieasc and an exonuclease blocking moiety to pit)tect the regions of interest 
from digestion; and chemically cleaving at a chemically cleavable site. For embodiments where 
cleavable sites are employed, the cleavable sites are often located in or near a flanking; region. 
However, the lai^et nucleic acids may be reduced in length by any of the methods known by 
those of skill in the art for cleaving within one or more flanking regions preferably without 
cleaving within the region of interest. 

Another aspect of the invention involves the use of cleavable primers to reduce the length 
of an amplified target nucleic acid. An amplified target nucleic acid may be reduced in length by 
cleaving at least a portion of one or more of the flanking regions having a cleavable site. In this 
context, the cleavable site may be introduced via a cleavable primer and may be located outside 
of the region of interest. Cleavable primers of the invention may include those having an 
exonuclease blocking moiety, a Type IIS restriction endonuclease recognition site, a Type II 
restriction endonuclease recognition site, and sites capable of being chemically cleaved. 

The restriction endonucleases employed with the present invention may include type II 
and type IIS restriction endoniicleases. The restriction endonuclease recognition sites may be 
either vwthin a primer region, or outside the primer region, so long as the restriction 
endonuclease cleavable sites are within or near one or more of the flanking regions. The 
restriction endonuclease recognition sites are preferably not within a region of interest. For type 
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II restriction endomicleases, the lestriction ebdonucleasc recognition site is the same as the 
restriction endonuclease cleavable site. For Type IIS restriction endonucleases, the cleavable site 
is at a defined distance away from one side of die recognition site, usually fix)m about i 4 to about 
20 base pairs aM«y. Thus/if the Type hS recogmtion site is co«ed^^^^^ 
the endonuclcase cleaving site must be within about 20 bases of that flanking region and is' 
preferably within 14 about bases of that flanking region. Thus, the term W as employed in 
this aspect of tiie invention means "within about 20 bases." 

. .Another embodiment of the invention involves reducing the lengUi of an amplified target 
nucleic acid and isolating a sirigle-stranded amplified tai^et nucleic acid at die same time by 
using a cleavable primer having an exonuclease blocking moiety. After amplification of die 
target nucleic acid, d,e amplified target iiucl«c acid will include an «co„uclease blocking 
moiety. The amplified target nucleic acid is dten treatd with a 5' to 3' exonuclease, which 
degrades the strand containing die exohuclease blocking moiety in a 5' to 3' direction only up to 
the blocking moiety. The 5' to 3' exonuclease iiiay optionally degrade die odier complementaor 
strand^ of d« amplified target nucleic acid, in cases wh^ the i»d,er strand does not have an 
exonuclease blocking moiety, m treatitterii witi, ti,c > to 3' exonucl««e leaves a reduced- 
Icngdi,<single-strmded amplified target hucleic aci^^ 

Cleavabte sites widiin cleavable priiners may include chemically cleavable groups 
incorporated widiin the phosphate backbone linkage {e.g. replacement of phosphate with a 
phosphoramidate) or as a substituent on or replacement of one of the bases or sugars of die 
oligonucleotide primfer (e.g: a modified base or sugar, for example, a more labile glycosidic 
linkage). Siicb chemically cleavable groups would be apparent to One of skill in the an in light 
of die present disclosure and include, for example, diijkoxy^lane. 3HS)-phosphon>thioate, 5'- 
(S)-phosphorodiioate. 3'-(N>.j*osphoroamidate. 5'-(N)-phosphoroamidate. and ribose. FIGS. 
16A and 16B depict a 3 -(S)-phosphorothioate arid 5'-(S)-phosph6roti.ioate- respectively as 
defined in this invention. Note diat diese linkages are often ref^d to as diibphosphates as well. 
A similar nomenclature is employed for 34N).phosph6roaniidate. 5'^).phosphbroamldate. 
The chemically cleavable site should genfeially be staWe under die amplification, hybridization 
and washing conditions to be eiiiployed and is preferably within one or more of die flanking 
regions. 

tea preferred embodiment, the cleavable site is located near the 3' end of die primer used 
to bind die amplified target nucleic acid to the solid support By locating die cleavable site near 



the 3' end, it is possible, to further reduce the length of the amplified target nucleic acid, 
eliminating a flanking region from the polynucleotide region of interest. Cleavable primers are 
described in PCT/US96/061 16. filed April 26. 1996 (incorporated herein by reference). 

Accordingly, cleavable primers may contain one or more restriction recognition sites of 
one or more dififerent restriction endonucleases; one or more cleavable sites of one or more 
different restriction endonucleases; one or more exonuclease blocking moieties; one or more sites 
cqwble of chemical cleavage; or a combination thereof 

Jhe present invention also provides methods for obtaining single-stranded or double- 
stianded amplified target nucleic acids. The isolation methods include direct attachment . of oiie 
of the two strands of a double-stranded amplified target nucleic acid or a set of such molecules, 
to a solid support . The isolation methods further include indiicct attachment of a singlc-stranded 
or double-stranded amplified target nucleic acid, or a set thereof" to a solid support via an 
attachment capable of attaching to a solid support via covalent or noncovalent attachment. 
Methods of direct attachment include for example, biotin/avidin interactions, as well as other 
inethods known by those of skill in the art. 

For example, in one embodiment, a strand of an amplified target nucleic acid may be 
bound or attached to a solid support to permit rigorous washirig and concomitant removal of salt 
adducts, unwanted oligonucleotides and enzymes. Either a double-stranded amplified tai^et 
nucleic acid or a single-siranded amplified target nucleic acid may be isolated for mass 
spectronietric fihalysii. The singlcstrandcd amplifled target nucleic acid analyzed by MS may 
be either the sttand bound or not bound to the solid support. 

When the unbound strand is used for; MS analysis, it is typically purified by first washing 
the bound strand and its attached complement under condiUons not sufficiently rigorous to 
disrupt the strand's attathment to its bound complement. After unwanted biomoleculcs and salts 
are rwnoved. the complement may then be released under more rigorous conditions (see FIG 
11). • • ■ ■ • 

In contrast when the bound strand is to be analyzed, it is typically washed under more 
vigorous conditions such that the interactions between the bound strand and its unbound 
complement is disrupted. This allows the unbound strand to be washed away with the other salts 
and unwanted biomoleculcs. Cleavable linkers or cleavable primers may be used to release the 
bound strand from the solid support prior to MS analysis. 
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»c.«i^^3-Mroxypic.,™c «ia (-J-HPA-,. ^ acid, picWc ..id. 
ilihydroxybenzoic acid, and uilrophenol. ' 

-ft. r«,uci„g «, ob«i„i»8 3«ps may occur cc^tc^, :^ any ortcr „ 
«.n,.a.«.^y. ™-.-^i»v.*do.c„con.p.«e3(i,™,.ci.,«»^ 

*c MWcn^ ^ n«,cic acid f,« iion, i.s ^ ^ ^ ^ 

(u, »„«,>..n.ous,y -edu^in, .engO, of u„ ^ „^^^^^ 
..s ccpte^n^, ^ ^ .„ ^ ^ 

^,rT^°" » - a ^ingie-s^cd .cduced-icn,^ ^ 

»«clc«.c.d...,fc^^.„^„^,^^„^^_^_,^^^^^__^^^_^^^^^^ --S 

.A««l« aap.c. Of d^ invenUon cncon,pa..s a maftod of d.«™i„i„g d» n»ss of a 
■"^ ""Ctocacid, where d« nucleic acid gene.,,, c„„«i„3 a firs, s«„d and a a=.«rf 
^n^P^n.^ su*.,. ,.e n^od of .Ke i„v=„,i„„ i„c,ud=s: ide„,i,yi„g a ™^eic acid; 

*e ,arge. nucleic acd prior .o Cueing tf. lengd, of d» .arge, ™„,eic acid ,„ 
^ «, amplified .a^e, nucleic acidi reducing ,hc lengU, of «» ..rge, „^ic acid by 
cle.v,„g a. leas, a ponion of one or more 6fa« flanking ^ , 

arge. nucleic acid. oNaining a single^ ^ ^ 

de^mnnrng d,e inass of .he single-s.ra»d«i reduced-length «.ge. nucleic acid u», a ^ 
spcc^mcrer v*e,e ,he ,arge. nucleic add firter comprises a region of imces, and one or mo,. 
fl»h«S .gions ««1 *e ol„.i,iing s,ep comprises ««lung U» firs, s.rand of ^ 

^Phficd «,ge, mKleic acid u> a soBd su^ ^ .he firs. s«u„ from ,he second 

^ U, poduc. * b.«d firs, s«and and «, unbound second s,„nd. In d,is cmbodimen, d,c 
mMs of Ihe unbound second SWBUI is deiennined. ■ 

IK prcsen. invenuon addiUonally encompasses primers and m«hods for using primers 
« capabte Of being .■a-rached- or bound u. a solid suppori. Oenerally, d,is is accomplish 
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by attaching a binding group or moiety to the primer or to a modified nucleotide during 
amplification, where the binding group or moiety is capable of attaching or binding the 
oligonucleotide to the solid support. This binding moiety may be attached to_the oligonucleotide 
primer or amplification product either directly, through an, intervening linking group or by 
specific hybridization to an intermediary oligonucleotide which is itself bound to a solid support. 
Binding moieties include functional groups for covalent bonding to a solid support, ligands that 
atbch to the solid support via a high-affinity, noncoyalent imeiaction (such as biotin with 
streptayidin). a series of bases complementary to an intermediary oligonucleotide which is itself 
attached to the solid support, as well as other means that.arc , well-known to those of skill in the 
art, such as those described in PCT WO 96/37630, incorporated herein by tefeience. 

The first strand is typically separated fypm die second strand by washing under conditions 
rigorous enough to disrupt the double-stranded basp, pairing stnicu^e. but not rigorous enough to 
disrupt the attachment of the boutid Grst strand to the solid, support. The solution-phase (or 
washings) containing d»e unbound strand can then be prepare 

and 

However, the cleavable site should preferably not be between the binding moiety, i.e. the group 
attaching the first bound strand to the solid support, and the region of interest. Alternati vely, the 
cleavable site should be incorporated into the second strand only, and not into the fim strand that 
is to be attached to the solid support. 

A preferred embodiment encompasses the use of a cleavable primer having a chemically 
cleavable group of 3'-(S)-phosphorothioate or 5'-(S)-phosphorolhioate, where the first strand Is 
biotinylatbd and bound to a solid support via a biotin.avidin interaction (i.e. where streptavidin 
beads are used for a solid support). It is also preferable to employ mass-modified nucleotides 
with this aspect of the invention. 

Alternatively, the obtaining step may include (a) attaching the first strand of the ampl^ 
target nucleic acid to a solid support, (b) separating the first strand from the second strand to 
produce a bound fi«t strand and an unbound second stnmd. .(c) removing the unbound second 
strand, and (d) releasing the bound first stiand fix)m the solid support to produce a single- 
stranded reduced-length ampfified target nucleic acid for mass spectral analysis. In Utis 
embodiment, the mass of the bound first strand is determined using a mass spectrometer. 

Several methods may be employed to release the reduced-length single-stranded 
amplified target nucleic acid from the solid support. Generally, tiie methods used must eiUier 
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employ rcversiWe chemical intenurtions between the binding group and the solid support, that is, 
a "cleavable linker/ or a Separate chemically or enzymaiically cleavable site somewhere within' 
the botmd product. Thus, these methods for releasing the bound strand include all of the 
methods that may be used for reducing the length of the bound strand as welL For example, an 
exonuclease blocking group, endonuclease recognition site, or a chemically cleavable site may be 
incorporated into the bound stiand between the binding moiety and the region of interest, 
cleaving at one of these sites through use of an exonuclease, endonuclease, or a chemical agent 
accompirshes both the releasmg and the reduction In length simultaneously. When inore than 
one target nucleic acid is identified for analysis, the target nucleic acids may be released and 
analyzed at the same time or consecutively. 

This invention also encompasses methods for release that do not include reducing the 
length of the amplified or uriamplified target nucleic acids depending on the method used to bind 
the amplified target nucleic acid to the solid support. For example, both the hybridization and 
biotin/streptavidin methods employ hieans such as denaturation to disrupt the noncovalent 
interactions and cause the release of the bound single-stranded target nucleic acids^ It may be 
prefeired to use a chemically cleavable site with the biotin/streptavidin method so that release of 
the taiBct nuddc adds can be pdfomied under relatively mi^ 

Another embodiment of this invention encompasses a method of preparing a double- 
stranded target nucleic acid for mass spectrometric analysis. This method generally includes 
comprising: amplifying a target nucleic acid to produce an amplified tai^et nucleic acid; 
attaching the first strand of the amplified target nucleic acid to a solid support to produce a bound 
first strand and an unbound second stiand; removing, or detaching, the unbound second strand 
from the bound first strand; releasing the bound first strand from the solid support to fonn a 
single-stranded amplified target nucleic acid; and determining the mass of the single-stranded 
amplified target nucleic acid using a mass spectrometer where the amplified target nucleic acid 
comprises a fust strand and a second complementary strand. In this embodiment, the unbound 
second stiand is typically removed from the bound first stiand by denaturing and washing. 

A prefenred embodiment encompasses employing a cleavable linker during the releasing 
step, wherein the determining step preferably does not involve sequencing the amplified target 
nucleic acid. 

The present inventioh also provides methods of detecting polymorphisms in one or more 
target nucleic acids. This embodfanent generally includes: amplifying at least one target nucleic 
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acid; reducing the length of at least one of the amplified target nucleic acids comprising cleaving 
bffa portion of one or more flanking regions, and determining the masses of each of the reduced- 
length amplified target nucleic acids using a mass spectrometer wherein said amplified target 
nucleic acid comprises a region of interest and one or more flanking regions. This method may 
be used to detect polymorphisms in a single target nucleic acid compared to a wild type target 
nucleic acid by detecting variability in mass. Other "alleles" of the target nucleic acid may also 
be detected using the methods of the invention. 

. ^ -In the present disclosure^ "wild type" is the standard or reference nucleotide sequence to 
which variations are compared. Thus, by definition, any variation from wild type is considered a 
polymorphism, including naturally occurring sequence variations and pathogenic mutations. 

In another embodiment, methods are provided for detecting polymorphisms in at least 
one target nucleic acid. These methods may include: amplifying at least one target nucleic acid; 
isolating either a positive or negative sUMd of the amplified target nucleic acid to form a single- 
stranded amplified target nucleic acid; and determining the masses of each single-stranded 
amplified target nucleic acid using a mass spectrometer where the amplified target nucleic acid 
comprises a region of interest and one or more flanking regions. 

In yet another embodiment^ methods are provided for detecting polymorphisms in at least 
one target nucleic acid by amplifying at least one target nucleic acid; reducing the length of at 
least one of the amplified target nucleic acids comprising cleaving off a portion of one or more 
flanking regions; isolating either a positive or negative strand of said amplified target nucleic 
acid to form an amplified target nucleic acid; and determining the mass of each single-stranded 
amplified target nucleic acid using a mass spectrometer where the amplified target nucleic acid 
comprises a region of interest and optionally one or more flanking regions 

The methods described in the present invention may also be used to detect 
polymorphisms in a set of different target nucleic acids, in this context, the methods should 
generally include: amplifying each of the target nucleic acids; reducing the length and/or 
isolating a single-strand of each of said amplified target nucleic acids; and determining the mass 
of each of the single-strands of said amplified target nucleic acids using mass spectrometry. 
Thus, these methods can be used to detect polymorphisms in a plurality of difTerent target 
nucleic acids simultaneously. 

Using the methods described herein, one can uniquely identify a genomic sample by 
amplifying the target nucleic acids: isolating single-stranded amplified target nucleic acids: and 
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■fc-imWng of *c si„glc»,„d.d ™pH««, ,3,^ .c«s „^ 

whcl, be „«d » indicate a disease «e. or p^siiy u, 6is^ ^ ^ 
sourceofihesaii,ple,ormaplocaiioiisina8«uimi. ' 

m a«,d»e.b«iin,e,«. m«hod,.„ prov^ 
o„ a^pLfled .arge. „„cWc acid a. one fl^ 

pol^de^, irp^em, ^ « leas. on. of *e a™p.ifi«. ,a^ ™.e.eio acids befo„ 
the isolating Step. 

•"•*«««™'»«-em.".ea»>*r.rd«^^ 
.»teng Step comprises toding U,c ..,pHr«, ^ „^.;, ^ ,„ , ^ _^ 

remo™,, step comprises u«„g one or mo« Wieim e™ionuc.e«cs » cleave off one or m«e 
flanking polynucleotide regions. 

The mass of a prefeaUy single-si^Kled amplified large, nucleic acid may be comp«d 
«.* .h. k„o«nor predicted mass of .he co„e^„di„g „i„ amplified urge. 

"».e.c a^d. dia. is. Ue „iM «,pe version of u„ ,^e, nucleic .id U« is ^ng screoJL 
polymorpWso. Alte^atfvely. dte masses of mo„ d,a„ one amplified «r^ nndeic acid ^ 

comp««l ^U, .b. »nm™ or predicted masses of Ute co™»p„,dmg wild .yp. ampUfied .arge. 

nucleic acids. ' ■ ^ 

The amplified Urge, nucleic acid or se. fcereof. can opU.™lly have one or mom 
nuc ccdes ^placed «viU, mass-modified nucleoiides. including mas^modified nucleofide 
analog. For example, no. 2A and F,G. 2B illuslmte d« i^^eas. in .esoluUon for a A u, T 
m.,a,ion where .he mass-modified nudeodde l«p,y„yld«,xyuridine fca, been us«. in place of T 
dunng PCR amplif,ca.io„. f^e use of d,is mass-modifled nucleoUde reaute in a sepan„io. of 
sp«« peaks Of «5 mass ^ U^ ^^, mass uni«. As iis example illusuates 
mass-modified nucleoUdes of dte ,„se„. invemion ma, eflec. subs.a«ia, increases in specrai 
-esoluuo. ^ only „lad.ely small modiflc.«o„s in ...Ss! Oiher exanples of mass-modified 
»ucl««Kl« .».iul in *. p««„ iava,fion indude S^J-aminoallylW-dUTP. S-bromo-dCTP 5- 
".do-dCTP. 7.med,yM0TP. 7-*.-.dOTT. drt,.. S-bromo-dUTP. l.N6.d«„o-dATP 5- 
mercun-dCTP. aninonted,ylcoumarin-6-dUTP. bi„.i„-16-dUTP. 5-me<hy|.dCTP T-deaza- 
dATP. alphadnoHiNTPs, n6-aminohexyl-dATP. 5-iodo-dUTP. 
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Another optional aspect of the invention is the inclusion of internal calibrants or internal 
seJf-calibrants in the amplified target nucleic acid or set thereof to be analyzed by mass 
spectrometry to provide improved mass accuracy. 

A preferred aspect of die invention includes the methods of detecting polymorphisms 
where the determining step further includes uUiizing internal self-calibrants to provide improved 
mass accuracy; The isolaUon methods separately or together may also be combined with the use 
of internal self-calibrants. 

- . - The above methods, separately or in combination, may also be combined with the use of 
mass-modified nucleotides and mass-modified nucleotide analogs incorporated , in the stagle- 
stranded or double-stranded amplified target nucleic acid or set of single-stranded or double- 
stranded amplified target nucleic acids to improve mass resolution between mass peaks. The 
methods of detecting polymorphisms may also include at least one single-stianded amplified 
target nucleic acid optionally having one or mote nucleotides repl^ with , mass-modified 
nucleotides. 

In another embodimem, kits for preparing amplified target nucleic acids for mass 
spectromeuic analysis are provided. The kits of the invention may include a first primer capable 
of binding a first strand bf one of the target nucleic acids at a region 5' to a region ofinterest of 
said target nucleic acid; a second primer capable of binding a second strand complementary to 
the first strand at a region 5' to the region of interest of the target nucleic acid; a DNA 
polymerase capable of extending the primers to form primer extension, products of the first and 
second primers; and a restriction endonuciease capable of reducing the length of amplified target 
nucleic acids where the first and second primers and said DNA polymerase aie provided in a 
concentration and buffer suitable for increasing the number of target nucleic, acids to form 
amplified target nucleic acids 

Another embodiment encompasses a kit for preparing a douWe-stranded target nucleic 
acid having a first strand and a second complementao' strand for mass spectt«meiric analysis 
including: a first primer capable of binding the first strand of the target nucleic acid 5' to a region 
ofinterest of the target nucleic acid; a second primer capable of binding the second strand of the 
target nucleic acid 5" to the region of interest of the target nucleic acid; a DNA polymerase 
capable of extending the primers to form an amplified target nucleic acid; and a restriction 
endonuciease capable of reducing the length of the amplified target nucleic acid. 
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The first and second primers and DNA polymerase may be provid d in a concentration 
and buffer suitable for increasing the number of target nucleic acids to forai amplified target 
nucleic acids. The restriction endonucleases may be Type II or Type IIS restriction 
endonucleases. Preferably, the first primer is biotinylated. preferably at or near the 5' end and the 
kit further comprises a solid support capable of selectively binding the first irtrand of the 
amplified target nucleic acid. Thus, where the firat primer is biotinylated, the solid support could 
be a streptavidin bead. Kits included in this invention may preferably also comprise a matrix, 
such 3*Kyd^xy|}Icoiinie add. - 

An aspect of the present invention also includes a kit for prci»ring a double^stranded 
target nucleic acid having a first Stra^ 

spectrometric analysns cbihprising: a first primer capable of binding the first suand of the target 
nucleic acid 5? to a region of interest of the target nucleic acid; a second primer capable of 
binding the second strand 5* to the region of interest of the target nucleic acid; anda DNA 
polymerase capable of extending the primers to form an amplified target nucleic acid, where the 
'first prirtier comprises k deavable primer cleavable by chemical or enzymatic treatment. 
Preferred cleavable primers iriclude those having an exohuclease blocking moiety, a Type II or 
Type n restriction endonudease recognition site, or a chemically cleavable site, such as a, 
modified base, a modified sugar, or a chemically cleavable git,up incorporated into the phosphate 
backbone Preferred chemically cleavable groups are dialkoxysilane. 3'-{S).phosphorothioate. 
5XS)-phosphor6thioate, 3'^(N).phosphoroamidate, or 5'-(N).phosphonMmidate.. 

Preferably, the kit may also contain a solid suppori capableof selectively binding the first 
strand of the amplified target nucleic acid. For example, if the first strand preferably comprises a 
biotin, the solid support could comprise a streptavidin bead. These kits may also pieferably 
further comprise a matrix, such as 3-hydroxypicoUnic acid. 

AiioUKT embodinient is a kit conuiinihg: a first primer capable of binding a first strand of 
one of the target nucleic adds at a region 5' to a region of interest of the target nucleic acid; a 
second priniercapsAle of bindiiigasecond straiid complemCT^^ to the first strand at a region 5" 
to the regionofintdest of the target nucleic add; a DNA polymerase capable of extending the 
primers to form primer extension products of the first and second primers, where at least one of 
die two primers is i cleavable primer. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. lA is a resolved spectnim of nudeic acid fiagments (DNA) in the 20,000 to 25,000 

Da range using M ALDI-TOF mass spectrometiy. This positive ion time of flight mass sjrectrum 

was obtained from 200 ihioles of DNA in 3-HPA the summation of 100 laser pulses at 266 nm. 

The spectnim is of a single-stranded 72-mer which also shows a 71-mer. The FWHM resoluUon 

is 240 clearly resolving matrix adducts (labeled M). 

FIG. IB displays the positive ion TOF mass spectnmi of a 88-mer parent peak and has a 

resolution of 330. This MALDI-TOF spectrum is a sum of 100 laser pulses at 266 nm was 

obtained from 200 fmolesof DNA in 3-HPA. 

FIG. 2A shows the mass spectrum of a heterozygous mix of wild type and mutant DNA 
fragments where an A has mutated to a T giving spectral peaks separated by 9 mass units. 

FIG.2B illustrates the effect on mass resolution of a mass-substituted base. The 
spectrum in FIG. 2B consists of a mass spectrum of a hetero^gpiis mix of wild type and mutant 
DNA fragments where A has mutated to T and the T has been replaced by heptynyldeoxyuridine 
during amplification of the mutant region (R = heptyhye). The spectral peaks are now separated 
by 65 mass units as compared to only 9 mass units in FIG. 2A. 

FIG. 3 is a diagram illustrating the effect of analyzing full-length double-stranded 
amplified target nucleic acid, where the blunt-ended douWe-strands result in unresolved peaks in 
the mass spectrum of the unresolved double-stranded fragments. In this instance, the source 
nucleic acid inay be amplified, for example, by PGR. and then mass analyzed as the full-length, 
double-stranded product. The amplified target nucleic acid should typically be no greater than 
about 100 base pairs in length. 

FIG. 4 is a diagram illustrating the effect of analyzing reduced-length double-stiandcd 
amplified target nucleic acid, where one of the strands has a 4 nucleotide overhang which results 
in fully resolved peaks of the double-stranded fragments in die mass spectrum. In this case, die 
source nucleic acid is amplified (e.j?. PGR) in step (a), then reduced in length by restriction 
digestion of the double-stranded product in step (b) to yield an uneven ended product. The 
reduced-length, double-stranded product is then mass analyzisd to yield to fiilly resolved peaks. 

FIG. 5 illustrates that analyzing only a single-stranded amplified target nucleic acid 
reduces the number of strands and simplifies the mass spectrum. The source nucleic acid is 
amplified (e.g. PGR) in step (a) and then captured to a solid phase streptavidin bead in step (b). 
The solid phase is then rigorously washed to remove salts and unwanted biomolecules including 
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the bottom complementaiy strand in step (c): The isolated full-length, single-stranded amplified 
target strand is finally released into solution to be mass analyzed in step (d). Although this 
diagram as well as the diagrams in FIG. 6, FIG. 7. FIG. 8, RG. 9, FIG. 10, FIG. ] j and FIG. 12 
depicts the use of a biotinylate top primer (B) anda streptavi din solid support, these schemes ate 
also generally applicable to other methods of selecUvely binding one strand of an amplified 
target nucleic acid to a solid support to fecilitate the washing away of unwanted biomolecules, 
salts and the other cohiplementary strains. 

- nC. 6 employs a cleavable primer to reduce iri teiigth an amplified target hucieic acid 
that is larger than 100 base pairs to less than 100 nucleotides in length. The cleaving at the 
cleavable primer site also releases the single-stranded amplified target nucleic acid from the solid 
support. In this instance, the top primbr is biotinylated and cleavable. In step (a) the source 
nucleic acid is amplified (e.g. PCR) and captured to solid phase streptavidin bead in step (b). 
The solid phase is then rigorously washed tb remove salts and unwanted biomolecules including 
the, bottom complementary strand in step (c). The isblatcd. reduced-length, single-stranded 
amplified:target is released into solution by cleaving the prirher at the cleavable site in step (d) 
for mass analysis. 

FIG. 7 is ia diagrain illustrating the isolation of a singie-stranded amplified target nucleic 
acid that has been reduced in length by cleaving off at least a portion of both Hanking regions. 
The first flanking region contains a cleavable site in the cleavable primer, located outside of the 
region of interest The second flanking region is on the opposite end of the amplified target 
nucleic acid arid the portion of that second flanking region is cleaved off by digestion with a 
restriction endonuclease. The top primer is biotinylated and cleavable. The source nucleic acid 
is first amplified {e.g. PGR) (step (a)) and captured to the solid phase (.^ g streptavidin bead) 
(step (b)). The double-stranded target is then selectively restricted outside the genetic region of 
interest (step (c)). The order of steps (b) and (c) may be reversed. The solid phase is rigorously 
washed to remove salts and unwanted biomolecules including the bottom complementary strand 
(step (d)). Finally, the isolated, reduced-length. single-stranded amplified taiget strand is 
released into solution to be mass analyzed by cleaving the cleavable primer (step (e)). 

FIG. 8 shows the isolation of a single-stranded amplified target nucleic acid, where the 
length of the ampUfied taiget nucleic acid is reduced by cleaving off a portion of both flanking 
regions by ( I) using a first or top primer having a chemically cleavable site incorporated during 
amplification; and (2) using a bottom primer having a Type IIS restriction endonuclease 
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recognition site at the 5' end during ampiificaUpn. The target nucleic add is fiist amplified {e.g., 
PGR) (step (a)). The use of the bottom primer with a type US icsariction enzyme recognition site 
on the 5' end allows for the incorporation of this site on the end of the target nucleic acid. The 
top primer is then captured to a solid phase {e.g., a streptavidin bead) in step (b). One flanking 
region is thra cleaved offby digesting with a Type IIS restriction endonucieas^ After 
rigorously washing to remove salts and unwanted biomolecules, including the unwanted 
(uiibound) complementary strand in step (d), the single-stranded amplified target nucleic acid is 
released from the solid support by cleaving at the cleavable site within die cleavable primer in 
step (e) for mass spectral analysis. The order of steps (b) and (c) arc rcversible. 

^J^^"^ ^o^tT einbodiment of the invention, wherein, one primer has ah 
exbnuclease blocker (.). After amplification of the target pucleic acid, step (a); the amplified 
target nucleic acid contains an cxonuclease blocking group. The amplified target nucleic acid is 
th^n treated with a 5' to 3' cxonuclease, step (b% which degrades the strand ^cohtaimng the 
exonucicase blocking group only up to the bloQking group. The 5' to 3* exonucl«»se completely 
degrades the other complementary strand of the amplified target nucleic acid as the other stiand 
does not have an cxonuclease blocking groiip. The treatment with the 5' to 3* cxonuclease; thus, 
leaves a single stranded amplified targe^ nucleic acid for mass speptroraetric analysis/ 

FIG. 10 is a diagram illusiranng yet another einbodiment. in which one primer contains a 
Type lis restriction recognition site and a binding moiety, e.g., biotin (B). wherein the Type IIS 
restriction cleavage site is located between the Type IIS restriction recognition site and the 
binding moiety. The source nucleic acid is first arnplified using this primer and another primer 
complementary" to tiw other strand, step (a). The amplified target nucleic acid is then restricted 
with' the Type IIS restriction endonuclease corresponding to the Type IIS restriction recognition 
and cibavable sites in the primer (step (b)), leaving a reduced-length arnplified target nucleic acid 
comprising a binding moiety, e.g. biotin. which can then be captured to a solid phase 
(streptavidin bead) (step (c)). The reduced-length amplified target nucleic acid is theh rigorously 
washed to remove salts and the unbound complementary strand, step (d). Then the reduced- 
leiigih. single-stranded amplified target nucleic acid is released fiom the solid support for mass 
spectromeuic analysis by deiiaturing the biotin ^treptayidin bond. e.g.. by boiling under low salt 
conditions, step (e). 

FIG. li is a diagram illustrating a variation of the embodiment illustrated in FIG. 10. 
wherein instead of isolating the bound reduced length, single-stranded amplified target nucleic 
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ac.d. tbe complementary (unbotuK.) strand is released from the bound strand and isolated for 
IIU.3S spectrometric analysis. 11^ Tleamplified 
taiBetnucleicadd is then cleaved with a type IIS restriction endonuclease (step-(b)) andcaptmed 
to a sohd phase (..g.. by biotln (b) interacting with a streptavidin bead) (step (c)) Unvvamed 

salts and biomolecules are removed by washing (step (d)). This time! in contrast to the schemes 
depictionFIG.5.FIG.6,FIG.7,FIG.a^^ 

d.srupt the interaction between the bound strand and its complement In step (e) the 
complementary strand is released for mass analysis. 

FIG. 12 is another embodiment Wherein the double stranded DNA is.mass analyzed The 
, sourcenucleicaddisamplified(step(a)): ,„this«,bo^^^^^^ 

contains a type IIS restri.t,o« site ^ueh 'that th. cleavage site is between the region of interest and 
the btotin moiety. The^mpiified target nucleic acid is captured to the solid phase (step (b)) to 
facihtate washing away unwanted salts and biomolecules (step (c)). A ^striciion endonuclease 
.s then used taxelease the double stranded ffegmcm for mass spectral analysis (step (d))^ 

FIG. 13 is a mass spectrum of single-stranded amplified short tandem repeats from the 
tyrosirie hydroxylase geiie'THOllocus. 

FIG. .14.is a mass spectrum of an allelic set (ladder) of single-stranded amplified target 
nucleic acids from th. THOl gene locus, where.n .he single-stranded amplified target nucleic 
acds ranged in length from 71 to 95 nucleotides in length. The method used to p^duce this 
spectrum is the one depicted in FIG. 6. 

FIG. 15 is a mass spectrum of a set of single-stranded amplified target nucleic acids 
wherem the single-stranded amplified target nucleic acids were the same as those depicted in 
FIG. 14 except that the lengths of the amplified target nucleic acids had been reduced by 31 base 
pairs by endonuclease cleavage. The method employed to prduce this spectrum is the one 
illustrated in FIG. 7. 

FIG. I6A shows die chemical fommla for 2'-deoxythymidine-3-.(S)-phosphorothioate 
FIG. 16B shows the chemical formula for 2'-deoxythymidine-5 -(S)-phosphorothioate. , 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
The present invemion, di,^cted to methods of and fkits for preparing target nucleic acids 
for mass spectrometric analysis and for detecting polymorphisms, provides advantages of 
.echmcal ease, speed, and high sensitivity. Additionally, only minute samples of femtomole 
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amounts are required. The methods and kits described herein yield a mihihial set of products 
with improved mass resolution and accuracy and detailed information about the nature of the 
polymorphisms detected in the target nucleic acids screoied. 

One embodiment of the present invention involves methods of detecting polymorphisms 
in one or more target nucleic acids comprising (a) amplifying at least one of said lai^et nucleic 
acids, wherein each of said target nucleic acids comprises a region of interest and optionally one 
or more flanking regions, (b) isolating either a positive or a negative strand of interest of each of 
jaid,target nucleic acids in the form of one or more single-stranded amplified target nucleic 
acids, wherein said isolating preferably comprises binding said strand of inteicst of each of said 
amplified target nucleic acids to a solid support, and (c) determining the masses of each of said 
single-stranded amplified target nucleic acids using a rhass spectrometer, wherein said 
determining preferably does not involve sequencing of said amplified singte^stiandcd target 
nucleic ac>ds. The amplifying step may include the use of a specialized primer that can be used 
in the isolating step to bind the amplified target nucleic acids to a solid support: The primer may 
also have attached a cleavable or reversible linker, or the primer itself may contain a cleavable 
site. If a cleavable site is introduced into one of the amplified target nudeic acids by using a 
cleavable or revensible linker during said amplifying step, the determining does not involve 
sequencing of the amplified target nucleic acids. The primer may; also be biotinylated or 
modified in other ways such as to effect binding of the amplified target nudeic acids to a solid 
support. The primer may al.so optionally be bound or attached to the solid support prior to being 
amplified. One of ordinary skill in the art will appreciate the multiplicity of methods to effect 
such attachment 

The isolating may further comprise denaturirig and washing to remove the 
complementary strand from the strand of interest which is bound to a solid support, followed by 
release of the bound single-stranded target nucleic acids from the solid support. Alternatively, 
the unbound complementary strand may be released and isolated for mass spectrometric analysis. 

Afier amplifying and either before or after the amplified target nucleic acids have been 
bound to a solid support, the amplified target nucleic acid may be reduced in length by a number 
of different techniques. For ex^ple. one or more fianking regions may be cleaved using one or 
more restriction endonucleases. such as Type 11 or Type IIS restriction ehdonucleases or 
combinations thereof. The amplified target nucldc acid may also be reduced in length by using a 
cleavable primer. Another mediod of reducing length comprises using an exonuclease blocking 
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moiety in one of the two primers for ampIificaUon, and digesting said inpUficd target nucleic 
acid with a 5' to 3' exonudease. 

The target nucleic acid may be single^steanded or doubic-stranded DNA, RNA or hybrids 
thereof, fiom any sbuice.- The target nucleic acid is generally a nucleic acid which must be 
screened to determine whether it contains a polymorphism The cobesponding target nucleic 
acid derived from a wild type source is referred to as a Wild type target nucleic add. TTie 
amplified target nudeicacids can be obtained from a ^urce sample containing mrcldc adds and 
can be,H«duced fh>m the nudeic acid by pgr™ amplification or 

Although human sources are preferred, any source which bne is interested in screening for 
polymorphisms may be used in the methods described herdn. When the target nudci^^ is 
RNA, the RNA strand is the -f sUartd. If desired, the ta^t nucleic abid may be an RNA/bNA 
hybrid, wherein either strand can be designated the + strand and the other, the - strand. 

in cases where the amplified target nuddc acid conuiins RNA. the methods using 
restriction endonudeases described herein cannot bfe used to directly deduce the length of the 
final product. A restricUon endonuclease may be used to reduce the lengA of the double- 
stranded DN A intermediates prior to the RNA trarisdiptioii^^s^^^^ 

The.amplified target liudcic adds are typically less than 100 bases in length because 
current mass spectromeUic methods do not have the rhass accuracy and resolution necessary to 
identify a single base change in polynucleotides huBer than 100 base pairs. However, as mass 
spectrometnc techniques for analyzing nucleic acids improve, the single-stranded or double- 
stranded amplified target nucldc acid.s of this invention may be larger than 100 bases in length. 

Due to the simpler mass spectrum that results from mass analysis of single-stranded 
amplified target nucleic acids, it is preferred to determine the masses of sets of singie-stranded 
amplified target nucleic acids. The am^fied target nucleic acids may also contain mass- 
modified nucleotides, which can ediance ease of aridysis, es^^cially when a poim 
polymorphism has resulted in a very small mass change (on the order of 9 Da) In a target nucleic 
acid as compared to the corresponding wild type target nucleic acid. The methods described 
herein use mass spectrometry to detemrine the masses of a single-stranded amplified target 
nucleic acid or set of single^tranded amplified target nucleic acids to detect polymorphisms in at 
least one target nucldc add. : 

The amplified target nuddc acids comprise a region of interest and optionally, one or 
more flanking regions. A region Of inters contains or is suspected of containing a 
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polymorphism, whereas a flanking region is generally believed not to contain a polyhwiphism or 
a polymoiphism in that region is considered unimportant. The region of interest niay be as small 
as a single nucleotide. A flanking region may contoin a cleavable site or cleavable moiety that 
can be selectively cleaved to release single-stranded nucleic acids from a solid support prior to 
mass spectromelric analysis. An amplified target nucleic acid may also optionally comprise 
ariother flanking region on the end of the target nucleic acid opposite from the cleavable site used 
for release from the solid support. This second flanking region may contain one or more 
restriction cleavable sites that do not occur in the region of interest. 

The mediods described herein may be performed on a single amplified taiget nucleic acid 
or on a set of different amplified target nucleic acids, each containing a different region of 
interest. The various steps of reducing length, binding to a solid support, releasing from the solid 
support may differ with respect to each different target nucleic acid in a set, or may be the same, 
and the resulting set of single-stranded or double-straiided amplified target nucleic acids can be 
mass analyzed simultaneously. Accordingly, another advantage of the methods described herein 
is that they can be used to prepare a set or collection of two or more different target nucleic acids 
in a single reaction or a single container, possibly using at least one commoh reagent, which 
results in increased efficiency and tnore informative data frt>m,a single mass spectrum of the 
prepared target nucleic acids. 

Wild type refers to a standard or reference nucleotide sequence, or number of repeat di-, 
tri-. or tetra-nucleotides, to which variations are compared. As defined, any variation from wild 
type is considered a polymorphism, including naturally occurring sequence polymorphisms, and 
mutations which are pathogenic. 

Two nucleic acids are considered "coniplementary" if they are capable of specifically 
hybridizing to one another (i) under typical hybridization and wash conditions (see, eg., Maniatis 
ei aL 1982) or (ii) using reduced stringency wash conditions that allow at most abbut 25-30% 
base pair mismatches, for example. 2 x SSC. 0.1% SDS, room temperature twice. 30 minutes 
each: then 2 x SSC. 0.1% SDS. 37°C orjce. 30 minutes; then 2 x SSC room temperature twice, 10 
minutes each. 

The types of mass spectrometry used in the invention include ESI or MALDI. wherein 
these methods may optionally include time-of-flighl. The significant multiple charging of 
molecules in ESI and the fact that complex mixture analysis is often required mean djat the ESI 
mass spectra will consist of a great many spectral peaks, possibly overlapping and causing 
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analysis of the invention: wcmass 
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Benefits OF Analyzing Single-Stranded Nucleic Aods 

One object of this invention is the accurate mass determination of a single-stranded 
amplified target nucleic acid or a set of singlc-stnmded amplified target nucleic acids to 
detennine presence and character of any polymorphisms. The embodiments of diis invention 
include mass spectrometric mass detennination of the single-stranded amplified target nucleic 
acid or set of different single-stranded amplified target nucleic acids, as well as mass 
determination of a mass-modified, single-stranded amplified target nucleic acid or set thereof A 
prefetred embodiment is to detect polymorphisms in an amplified target nucleic acid in single- 
stranded form, wherein the single-stranded amplified target nucleic acid(s) are derived fiom one 
of either the positive or the negative strand of the genome. The examples of single-stranded 
methods described herein focus on single-stranded amplified target., nucleic acids derived fiom 
the positive strand, although the methods disclosed in the present invention are equally 
applicable to target nucleic acid as derived from the negative strand as.well. 

FIG. 3 illustrates that a double-stranded target nucleic acid comprising two 
complementary strands, produces two difficult to resolve peaks In the mais spectrum 
corresponding to the denatured single strands. The additional peaks from doublc-stianded 
amplified target nucleic acids as compared to single-stranded amplified target nucleic acids add 
to the congestion of mass peaks in the mass spectra, as well as introducing the possibility that it 
may be extremely difficult lo resolve the complementary fragments if they have nearly or exactly 
identical base compositions. Furthermore, some portion of the double-stranded amplified target 
nucleic acids do not fully denature, and mass peaks corresponding to the double-stranded 
products increase the ispectral congestion. 

Spectra using both strands may also contain a two-fold redundancy in data, since any 
polymorphism in one strand may be present within its complement. Oiie strand my be removed 
prior lo mass spectrometric analysis while still producing all data necessary for a complete 
polymorphism analysis. Therefore, it is often preferable to analyze a set of single strands, using 
only one of the two complemenUry sets of amplified target nucleic acids representing the full set 
of target nucleic acids. 

FIG. 4 shows the expated spectrum if only the positive strand of a target nucleic acid 
from FIG. 3 is analyzed by mass spectrometry as analysis of one of the two complementary 
strands of the double-stranded amplified target nucleic acids halves the number of expected 
peaks within the mass spectra, allowing for better resolution. Thus, removal of one of the two 



W098/12355 

28 PCT/US97/I7I0I 
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Approaches to Isolating Single-Stranded or Double-Stranded Ampufied Tarcct 
Nucleic Aans 

As described earlier, analysis of singlc-siranded amplified target nucleic acids is 
generally preferable since it provides a complete set of data with the minimal number of 
fragments and therefore simplifies the spectra and facUitates an increase in the total number of 
target nucleic acids that can be analyzed in a smgle assay. A number of approaches can be taken 
lowd the pnxluction of single-stranded amplified target nucleic acids and their purification 
wWcb.. .includes the elimination of undesired oligonucleotides. In some cases, it may be 
preferable to use a method of amplificauon that yields primarily single-stranded amplified tat^et 
nucleic acids, such as asymmetric PCR or transcription-mediated amplification. 

To isolate the singie-sttanded amplified target nucleic acids, the amplified tiirgct nucleic 
acids may be designed to be aiiached or bound to a solid support. Several means are available to 
effect this attachment to a solid support, including: (a) hybridization to a complementaiy, solid- 
phase bound nucleic acid capture probe (which can be an oligonucleotide or one stnmd of die 
amplified target nucleic acid) comprising a fust binding moiety that spedfically binds to a 
serond binding moiety attached to a solid phase; (b) direct binding of the amplified target nucleic 
acid strands of interest, each comprising a polynucleotide region of interest, and a first binding 
moiety, to a second binding moiety atuiched to a solid phase (e.g. biotin/streptavidin or avidin or 
antigen/antibody pairs); or (c) direct covalent aUachmcnt of the strands of interest to a solid 
support. 

A capture probe is an oligonucleotide that comprises a portion capable of hybridizing to a 
nucleic acid, such as an amplified target nucleic acid, and a binding moiety that binds the capture 
probe to a solid phase, either through covalent binding or affinity binding, or a mixture thereof. 
A capture probe can itself bind to a solid support via binding moieties (direct capture) or can 
bind to a solid support via another capture probe that binds to a solid support (indirect capture). 

A preferred embodiment is the use of a biotinylated amplified target nucleic acid coupled 
to streptavidin or avidin attached to a solid support where the strand of interest is itself bound. 
Biotin coupling to streptavidin (or avidin) requires that any amplified target nucleic acid or acids 
contain a biotin. The biotin is pan of the linker molecule. It is straightforwaid to capture the 
amplified target nucleic acid because biotinylated primers dui be used in the PCR amplification. 
If only one of the two strands of an amplified target nucleic acid is to be analyzed by mass 
spectrometry, only one of the two PCR primers for each different target nucleic acid should be 
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btetMated: For each target nucleic acid, the PCR primer to be biotinylated should be the 
pnmer that ,s extended to fonn the single-stranded amplified target nucleic acid of interest. 

The amplified target nucleic acid or set of amplified target nucleic acids ean be co^^ 
attached to a solid support using any ofthe number of methods commonly employed in the art to 

. '™nob.l«eanoUgonucleotideorpolynucleotideonasolids«pport. The an^Kfied target nucleic 
ac.d or set of amplified target nucleic acids covalenUy attached to the solid support should also 
be stable and accessible for base hybridization. 

^^^'^^^^ 

acads to the solid support may occur by reaction between a reactive site et a bincfeg moiety on 
the solid support and a reactive site or another binding moiety attached to the taiget or via 
unervenme linkers or sp»er molecules. Where the two binding moieties can react to fom. a 
covalem bo«l. Coupling of an amplified targe, nucleic acid or set of amplified target nucleic 
acds to a solid support Aay be carried out through a variety of covalent attachment functional 
groups. Any suitable functional group may be used to attach the amplified target nucleic acid or 
set .of amplified target nucleic acids to the solid support, including disulfide, carbamate 
hydrazone. ester. N.funcfionalized thiourea, functionalized maleimide. mercuric-sulfide. gold, 
sulfide, amide, thiolester. azo, ether and amino: 

The solid support may be made from a wide variety of materials, such as cellulose 
nitrocellulose, nylon membranes, controlled-pore glass beads, ac^lamide gels, polystyrene 
actrvaed dextran, agarose, polyethylene, funconaiized plastics, glass, sil.con. aluminum, steel, 
.ron. copper, nickel and gold. Some solid suppon matenals may n^quirc functionalization prior 
to attachment of an oligonucleotide or capture probe. Solid supports that may require such 
surface modification include aluminum, steel, iron, copper, nickel, gold, silicon, and 
nonfimctionalized polymers. Solid support materials for ase in coupling to a capture probe 
.nclude luncbnalized supports such as the i.r-carbonyldiimidazole activated supports available 
fh,m Pierce , Rockford, IL) or function^ized supports such as those commercially available from 
Ghrron Corp. (Emeryville, CA). Binding of an amplified target to a solid support can be carried 
out by reactmg a free amino group of an amino-modified targe, with the reactive imidazole 
carbamate of the solid support. Displacemem of the imidazole gn,up results in formation of a 
stable N-alkyI carbamate linkage between the amplified tai^et and the suppon 

The amplified target nucleic acid or set of amplified target nucleic acids may also be 
bound to a solid support comprising a gold surface. The amplified target nucleic acid or set of 
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amplified target nucleic acids can be modified at their S'-end with a linker arm terminating in a 
thiol group, and the modified amplified target nucleic add or set of modified amplified tai^et 
nucleic acids can be chemisorbed with high affinity onto gold surfecesCHegner. etoL 1993b) 

In methods in which a solid-phase approach is used, preferably the double-stxanded 
amplified target nucleic acid or set of amplified target nucleic acids may be washed to remove 
deleterious contaminants. However, when the amplified target nucleic acid strands of interest are 
ditecUy bound, either covalenUy or via biotin/streptavidin biotin/avidin interactions it is 
pteferable to rigorously wa.h the sample to yield the highest purity. Such a rigorous' wash 
typically removes the complementao' strand, if present. isolaUng the single-stmnded amplified 
target nucleic acid. Following washing, it is necessary to release single-stianded amplified tai^et 
nucleic acids from the solid support for mass spectrometric analysis. TT^e isolation of a set of 
smgle-stranded amplified target nucleic acids may be perfonned on the same plate that is used 
wuhin the mass spectrometer or on a separate surface such as beads or a filter. Both the capt™,. 
probe hybridization and biotin/streptavidin or biotin/avidin approaches can use a number of 
means of denaturatibn to disrupt the noncovalent inteiacuons and afford release of the set of 
smgJe-stranded amplified targetnucleic acids bound to the solid support. 

Alternatively, a cleavable linkage may be inco^^ 
and the amplified targc^t nucleic acids. Any covalem coupling chemistry may be reversible or it 
may be necessary to incltKle a separate chemically cleavable linkage somewhere within the 
bound product. It may also be useful to use a chemically cleavable Hnka.e approach with the 
biotm/stroptavidin (or avidin) strategies so that release of the double-stranded target nucleic acids 
can be performed under relatively mild condiuons. In all cases the cleavable linkage can be 
located within the linker molecule com^ecting the biotin and the base a disulfide bond in the 
Imker). within the base itself (eg. a more labile glycosidic linkage), or within the phosphate 
backbone Imkage {e.g. replacement of phosphate with a phosphoramidate). 

Another way to isolate single-stranded amplified target nucleic acids is to use a primer 
composing an exonuclease blocking moiety and to treat with a 5' to 3' exonuclease which 
digests the strand lacking an exonuclease blocking moiety and the portion of the other strand up 
.0 the exonuclease blocking inoiety. leaving just the portion of the strami containing the 
exonuclease blocking moiety and the ponion of that strand 3' to the exonuclease blocking 
moiety. This method is described in the methods of reducing length section herein Single- 
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str^ amplified target nucleic acids can also be isolated by using a DNA-specific or RNA- 
specifie nuclease to digest an RNA/DNA hybrid. 

The use of two primers each comprising an exonuclease blocking moiety, wherein each 
pnmer bmds to a differem <:omplementa,y stn«d. is another way ,o isolate a double^tianded 
amplified target nucleic acid: 

METHOMOFlUDUCINOl^CtliofAMrUTOD 

• Ate iKe mpUflcMion Vlarget «plified »ddc «:ids which 

.n do„bl«»«W fe«. dan b. cleaved wi* ,csuic«p„ enJo™cl«,„ „ ".^ 

no, wMh A. region if im«e«. „h„ei„ s«d Region ofto^rcs. is «.sp««i „r 
contamirig a polymorphism^ 

'f DNA ,«rt«i„„ ondonuclcasc « ««d ,o ™„ove one or ™,e nanlcing .egions fro„ 
an «nph«ed «,ge, .ukI* add prio, ,„ isotaing U„ .ingfe-suandcd o, doobie-stranded 
".^Mcd ■a.ge, n,^ «dd(i). H - be necesiry for 
.<lonble.aranded row prior lo WisWcaon, or n.o« specifi^^^^ 

^cogmUon sites am. deavinj si«s be locked in double-sira^lcd DNA regions Itoidng or 
ootade U« region of iiaeies, The alcen^ive ,o having M,y do„ble-a™ded DNA pHor «> 
"=»ne„on is .ohybridize restticrton siK oligonoclconde probes ,o single-s^nded DNA «teei„ 
.he resmcion si.e oligo„ucleo.ide p™bes are con,plen»«a.y «, ,he resttiaion sitts forseleoed 

restriction endonucleiases. 

The basic known n,e«KKls for DNA isoUUon - p™=ipiuu„„, dialysis. ffluaUon and 
ch^ma-ography do ho. isbla« single-siranded from doyble-sn^ided DNA. If Uiese p^Ticato, 
me*ods are cnrployed. and i, may be desi,ed » pnxince a single-snamled p,od«t i, 
necessary ro add a separate srqi irfKre single.sl.^ 

rf restricdonendont^ieases are ,«d to cleave offone or more regions f.«n », amplilied 
«rget „ncleic «nd. a^fared mc,H«i for isolating siogle-stntnded «nplified targe, nucleic acids 
fton. these prodt^s is U.^^ tas, one biodnylated pdn», loc«d „ „„c end of an an-plifed 
target ITiucleic acid. 

The production Of tedntiid lengU, ampliHed targc, nucleic acids can provide benefils of 
■nc^ased accuracy'ami resolution in the mass spectrometric a„ate.s of even do„ble.,randed 
-nphlied target nucleic acids. Double-str^Kled amplified target nucleic acids can have their 
length reduced in a simih. manner to that used for processing single-stnu^ted amplified target 
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nucleic acids. Either endogenous restription recognition sites outside the region of interest or 
primer-incorporated restriction recognition sites» as described below, or combinations thereof can 
be used. Endogenous restriction recognition sites are those that are found-naturally within one or 
more flanking regions. 

In cases where one or more endogenous restriction recognition sites cannot be found 
outside the region of interest an alternative method is necessary for reducing the length of the 
ampHfied target nucleic acid. Use of a modified primer during the amplification process can 
mediatfe the incorporation of a Type II or Type IIS restriction endoniiclease recognition site 
within a primer region of the amplified target nucleic acid. Type IIS restriction endonucleases 
recogniK a particular double-stranded sequence region and selecUveiy cleave the double strand a 
defuied distance away from the recognition site.. As an example, the restriction enz>^es Bpmi 
md Bsgt cleave t|ie double su-ands 14 nucleotides (top strand) and 16 nucleotides (bottoni strand) 
avray from the recognition sites. Other representative Type IIS restriction enzymes include 
BseRl, BsmRI aad Fokl. See New England Biolabs 1996 Product Catalog. Use of Type IIS 
restrictioii for the reduction of amplified target nucleic acids is illustrated in FIG. 7. 

The restriction method for reducing the jength of an amplified targei nucleic acid affords 
significfflit advantages where double-stranded amplified target nucleic acids are to be analyzed 
by mass spectrometry. For instance, the smaller molecules are easier to resolve. Moreover, a 
second beneficial effect of using restriction endonucleases to reduce length, specifically one that 
does not produce blunt ends, is the production of two strands of different lengths and hence 
different masses. The creation of two complementary strands of different lengths, e.}^. 4 to 6 
nucleotides differerce in size, yields dramatically improved separation and resolution of two 
complementary strands during mass specirometric analysis as shown in FIG. 4. In many cases. 
reducUon of length by restriction endonuclease digestion can eliminate the need for single-strand 
isolation. 

In one embodiment, the resttiction endonuclease recognition site can be the same as the 
site of cleavage, located in the flanking region opposite from the end of the amplified target 
nucleic acid that is bound to the solid support. In another embodiment, the restriction 
endonuclease recognition site is different from the site of cleavage, as in the case of Type IIS 
restriction endonucleases, which cleave at a defined distance (20-40 bases) from one side of their 
recognition .sequence. When a Type US restriction endonuclease is used to reduce the length of 
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oneo^Mh „,io. flanking a „,i„„ .f ^ Ate^y, Type ,,1^ 

"".ogenons Type „S ^e.i„n «a.S « ..anccCt 

~ u.„« a ...... pHn.e, „ ^ „^ ^ oi. „ „^ ^ 

» -..^.on .cogniUon A ^^^ ^ ^ ^ ^ ^ 

'^'^'^m^^^^^^^,^^ MI«aic.pHn,e.a„:;: 

T'™"- 

- -^«s»..n^.ns«e.ean.^^^^^ 

.«n^be^ .^nee „.,^^^„^,„^,^^ ^_^ To. .a„.p,e. .^.^^ 
conu,.,^iUo,«-,e.nc>e3va«e„e.o,.„exon„c,eascb>„cki„,„K.ieiv ^ 

2"---oc.„. .o.,y. wHe.. .„n.,.a. B.„ek.. ™oie„ p^/. 
exonnCa^e f™„ d„e«in, a ,e^„„ „f ,.„,.^ , ^ ^ 
„ Tl.e exonaclea. b,«king .oie,, can ,„c,«,e nxK^n^ nuCeoUdes «„. p^, 5. i 

pZrr" V""^ "«yrpta«pho««s, .K^ 

Phosha^. and pep,,* nucleic acids CPNA, TO., depict U« if an MocU^ 

n,o«ya„dancxo„ac,eaa..o«»,ce».,eng*ofa,a„e.n,.,cic..d. The nucieo^des ,„ 
^ W U,. connceaa. a. a .„,Up,iei„ „r ^ ^ ^ 

adjacent to one another. . ' 

has hel"""? " ■''^ -8-Pk«Pl-= backbone 

^ Been „pUced . ^ ^ ^^^^^ ^ 

sequence-specific binding to DNA and RNA anrf • . 

. « vyj. A preferred 5 ,o . exonucJease is Exonuclease JII. FIG. 9 illustrates 
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how the use of this method results in isolation of a reduced-length single-stranded amplified 
target nucleic acid. 

The exonuclease approach to reducing length can be also used in combination with one or 
more of the restriction endonuclease cleavage techniques described above to reduce length. In 
such a combined approach, the restriction cleavage should occur before the exonuclease 
digestion. 

One of ordinary skill in the art will appreciate that the above methods of reducing length 
my siiso be used as means of isolating a reduced-length amplified target nucleic acid or a 
reduced-length single-stranded amplified target nucleic acid. For example, a chemically 
cleavable site can be incorporated in a Hanking region and cleavage at that site can Accomplish 
both length reduction and release from a solid support at the same time. Thus, for example, after 
the amplified target nucleic acid is reduced in length, either the boimd strand or the unbound 
strand or both strands of said amplified target nucleic acid can be isolated for mass spectrometric 
analysis. 

fn cises where the single-stranded amplified target nucleic acid strand to be analyzed is 
directly bound to the solid phase, it can be rigorously washed to remove- unbound components, 
including any number of deleterious contaminants, including the linvvanted complementary 
snrand, nucleic acid fragments containing at least a portion of a Hanking region, salts, enzymes, 
and other reagents. These unbound components can be removed from any nucleic acid bound to 
a solid support in any of the embodiments described herein and combinations thereof. If the 
strand lo be analyzed (the strand of interest) is not bound directly but rather via hybridization to a 
complementary nucleic acid, it cannot be as rigorously washed arid thereby cannot be purified to 
as great an extent. Direct binding of the amplified target nucleic acid strand to be analyzed 
ultimately produces a higher quality signal, e.g. less salt adducts. during mass spectroihetric 
ainalysis. thus improving mass resolution and accuracy. 

Following the necessary wash steps, the single-stranded amplified target nucleic acids are 
released from the solid support and analyzed by mass spectrometry. Note that regions that are 
cleaved off by one or more restriction endonucleases are released into solution and washed aWay, 
and are therefore not analyzed. Loss of these flanking regions can enhance the ability for mass 
spectrometry to quickly identify the existence of a polymorphism. The isolation of the single- 
stranded amplified target nucleic acids occurs prior to the mixing of the single-stranded 
amplified target nucleic acids v«th die matrix material for mass spectromeiric analysis. 



15 



10 



W09I8/I235S 

36 PCT/US97/1710I 

a linker n,o,ecu,e co™«i™ fte bk>U„ .„d I t """^ " '""'^ 

• • ~ amplified target nucleic o^a 

^ Mass sp«™^^ ^ ,yp,can, ^.^ ^ 

. ■ ' an associaledmiBMunKv and 

prccBion, However. Uk calibration, and associai«l . "Wcyaod 

~« specron,en, ^s«. (incindin, ^ZZZr7 — ' 

----ese.„.r:::zrrre::::::2°: 

ca,i,»„. ^^^^ _ - -Of 

-n^. bu. Because „ „„. ,„^^ ^ ^ ^ ~ 

^^n^ ^ ^> ime™., caliL J 
disadvantage of using extemaJ calibrants is f h-.. u ants do. Another 

an extra sample to be analyzed bv the 
mass spectrometer. For MALDI-TOF MS ir..n^»ii i -naiyzea by the 

i^ui lut MS. generally only two calibrant molecules are needed for 
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compJete calibration, although sometimes three or more calibrants are used. All of the 
embodiments of the invention described herein can be performed with the use of internal 
calibrants to provide improved mass accuracy. 

Using the methods described herein, one can obtain a mass specmim with numen>us mass 
peaks corresponding to the ^t of single-stranded amplified target nucleic acids under study If 
no polymorphism is present in any of said target nucleic acids, all of ,he mass peaks 
corresponding to the amplified target nucleic acids will be a. mass-to-charge ratios associated 
with the set of amplified target nucleic acids from the wild type target nucleic acids. However if 
a target nucleic acid contains a polymorphism, usually no more than one or two of the nass 
peaks will be shifted in mass, leaving the majority of mass peaks at unahered locations In a 
preferred embodiment of the invention, a self-calibration algorithm uses these nonpolymorphic 
or u^nutated target nucleic acids for internal calibration to optimb.e the mass accuracy for 
analysis ot the singie-stranded a-pplified target nucleic acids containing a pblymonAism thus 
requmng no added calibrant(s). simplifying the calibration, and avoiding potential spectral 
overlaps. In a given sample. hc>wever. it will not be known a priori which mass peaks, if any 
are altered or shifted from their expected masses for the wild type target nucleic acids. 

The self-calibration algorithm begins by dividing up the observed mass peaks into 
subsets, each subset consisting of all but one or of the observed mass peaks. Each data 
subset has a differem one or two mass.peaks deleted from consideration. For each subset the 
algonthm divides the subset further into a firs, group of two or three masses wh.ch arc then used 
to generate a nevv set of calibration constants, and a second group which will serve as an internal 
consistency check on those new constams. The internal consistency check begins by calculating 
the mass difference between the m/z values calculated for the second group of mass peaks and 
the values corresponding to reasonable choices for the associated wild-type target nucieic acids 
The mtemal consistency check can thus take the form of a chi-square minimization where the 
key parame.er is this mass difference. The algorithm finds which data subset has the lowest sum 
Of the squares of these mass differences rcsultmg in a choice of optimized calibration constants 
associated with group one of this data suh.«a:t. 

After new self-optimized calibration constants are obtained, the mas.s-.o-charge ratios are 
detem«ned for the mass peaks omitted from the data subset; these are the amplified target 
nucleic acds suspected to contain a polymorphism. The differences frem the observed mass 
peaks for the wild type amplified target nucleic acids are then used to determine whether a 
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"»ssaccracofappraxtaaely,p„i„,0,„00. P "« should >«M . 

■n» m«hods described he^ p^u. MALDI-TOF MS a,«lysis „f , ^ . 

and »,e .„con>on«,o„ of ™«^™«,i„«, «ud«oddes to e„h«Ke d* „^ ^ff^ 

..d .p,.od ^ „^ »^ineii^r^«r: 

M«.R«o.^.M«sAcc„,.c,..«„r„i^„.M^M<.o.„„ 

Any of , he embodtaenB of rte invention described he«i„ opdonallv i™:lude amoved 
»^nuc.c. .ids one . „o„ „.^^. 

"-.modified ...eotides comprise nuclcoUdes o. „.,eo.ide ^aU^ J" 
"«-Mic«^ d,„ cl^nge Aci, „^ „,a,ive ,0 ^cleoddes d,a, «,ey ™tace L 
n««iif.edn^ddesl„conH>«»di™ote,a^„„e,eicacids„f.he 

F.r c»mple, .he n»ssH,odifled „,«,e„,K,e3 be able .o I. inc«po«W by DN^ or RNA 
pol,««se during an,plirK.,io„ of tf,e u^ge. „«,leic acid M„ or RIVA 

n-.«.des . i.,. .process 1 .o I^*. CZei LnTr 

W»,.sp.ci«ce.eava.eby.es«c.i.endc„^a,^^_*^2^^^^ 
"»d.Hca„o„s can aiso be inco^ra«d in *e .arse, nncieio acids o^U,e nve^irL^ 
en^^uc s,eps have been concl.*d. P^-e^..,,, . ^J^^ 
n»d,fy specific bases, such as Iced,o,cl or fon^ddehyde. «"a»«ac..„ 
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Any or all of the nucleotides in the target nucleic acids can be mass-modified, if 
necessary, to increase the spread between their masses. It has been shown that modification^ at 
the C5 position in pyrimidines or the N7 position in purines do not prevent their incoqwiation 
into growing nucleic acid chains by DNA or RNA polymerase, (Lee et aL, 1992). For example, 
an octynyl moiety can be used in place of methyl pn thymidine to alter the mass by 94 Da. 

Mass-modifying groups can be, for example, halogen, alkyj. ester or polyester, ether or 
polyethcr. or of the general type XR. wherein X is a linking group and R is a mass-modifying 
iriass-mbilifying group can be used to introduce defined mass increments into the 
target nucleic aciis; One of skill in the art will recognize that there are numerous possibilities 
for mass-modlfications useful in modifying nucleic acid fragments or oligonucleotides, including 
those described in OligonucleoUdes and Analogues: A Practical Approach. Eckstein ed. (Oxford 
1991) and in PCT/US94/0dj93. which are both incorporated herein by reference. 

At larger mass ranges (30,000-90.000 Da), the mass resolution and mass accuracy of 
current MALDI-tOF mass spectrometers will not typically be sufficient to identify a single base 
change. For this reason, it may be preferable to increase the useful mass range artificially by 
substituting standard nucleotides within a target nucleic acid with mass-modified nucleotides 
having significantly larger masj differentials. Use of mass-modifi^ nucleotides applies as well 
to the riiass range below 30,000 Da. Mass modification can generally increase the quality of the 
mass spectra by enlarging the mass differences between difFerem amplified target nucleic acids 
of similar size and composition. For example, mass-modificd nucleotides, can increase the 
minimum mass difference between two amplified target nucleic acids that happen to be identical 
in base composition except for a single base which is an A in one and is a T in the other. 
Nomwlly, these two target nucleic acids will differ in mass by only 9 Da. By replacing one of 
the bases with a mass-modified vereion during amplification, the rnass difference can bc > 20 Da 
The ilhrstrations of spectra in FIG. 2A and FIG. 2B depict the influence mass-modified 
nucleotides can have on target nucleic acid resolution. One example of the many possible mass 
modifications useful in this invention is the use of 5-(2-heptynyl)-deoxyuridine in place of 
thymidine. The replacement of a methyl group by heptynyl changes the mass of this particular 
nucleotide by 65 Da. An A to T transversion m a nucleic acid in which all thymidine bases have 
been replaced with 5-<2-heptynyi)^eoxyuridine would produce a peak shift of 56 Da as opposed 
to 9 Da for the same nucleic acid fragments without the mass-modified nucleotide. The use of 
mass-modified nucleotides is especially important in the analysis of single-stranded target 



40 PCT/US97/17101 
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The p,«„„M„«„,i„„ uso includes for preparing „«,eic „cids for n,.ss 

fi^^rraod o. one of ^d „^e acids a, a „^o„ . .o a region of i„.e^ of »ld rarge. 
nucle c acrd; a second prinrer capabl* of lindihg a ^d eon,p,en,c„., ,„ saidTr.. 
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»i.d ^pon capahte Of «.ecti«I, hinding either a p^itive strati „, a „e,a„vc strand 
composm, thertfion of i«ere« of an anrplilied tareet nucletc acids, and a maui, solution 
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cleaved by said Type II restriction endonoclease Tlris kit ™,- ,. 

*" ""y opuonally comprise a solid 

support and a itiatnx solution. 
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The following examples are provided to illustrate embodiments of the invention, but do 
not limit the scope of the invention. 

EXAMPLES 

Example 1. PCR Amplification of a Single Target Nii^leic Acid 

An example PCR protocol that may be employed in this indention is as follows. 
A sample containing 10-10.000 copies of a source DNA may be mixed with two antiparallel 
DNApiimers that surround a target nucleic aad, e:g. the codinig nigion for a gene involved in 
carcinogenesis. The target nucleic acid may be any sequence that ii' known or suspected to be 
pplymorphic, including STRs. SSLP, and genetic deletioiis, insertions, or point polymorphisms. 
The PCR mix may typically be composeid of: 8 ^1 2.5 mM deoxynucleoside iriphosphites. 10 m1 
IPX PCR buffer. 10 ^l 25 mM MgClj; 3 ^l lOiiM fbrwaid primerra Ml I O jiM reverse primer, 
0.3 Hi thermostable Taq DNA polymerase. 64.7 m 1 HjO, and 1 nl source DNA. The sample lube 
may then be sealed and placed into a thermal cycling device- A typical cycling protocol is as 
follows: , . 



Step 1 


95°C2min: 


Step 2 


95X 15 sec. 


Step 3 


15 sec. 


Siep 4 


72X Imin 


Step 5 


repeat Steps 2^ 35 limes 


Step 6 


72X5mm- 


Step? 


stop 



Example 2. ProducHon of Single-Stranded Nucleic Acids by Asymmetric PCR. 

The basic PGR procedure of E.\ample I can be modified in order to produce 
predominantly one of the two strands. These asymnietric procedures involve modifying the 
ratios of the two primers, a typical ratio is 10:1. These procedures are described in Molecular 
Cloning: A Laboratory Manual, Sambrook ei al.. 1989 (incorporated by reference herein). 
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Em-pi. 3. Pr«i«il« f DNA rt. Bl I^Utol PCR Producb. 

''"'^'«P™i™»«'c.p<.rt«g of a™plif,«i large, „,«,,eicac«^ 
of *. nvo prin^ used in PCR ^ir^ „^ ^ 3y„U«si^ed „i,h a bi„U„ n,oie,v ,„«^, 
or dK y e«l ctf fte oligo^deodd^, F„|,<,*i„g a stodaxd PCR, d» douMe-sttmded 
«.be b«^,„a«,„d.ph.. s^^cc^ P„exao^..,(,p„^, 
«™d=<.l'CR;™te,,s™,«,„Hd,5,, of 10mg/.rpa™„ag„edcs«pBvidi«o««lb.«Uto 
^wa^^n.buff„of2.0MNaa ,Op,MTHsCI.^ "».Et^A.pH,..0. Th.«„^i, 
mc»l«Bd.for, 5m.^ " -nixing. FoUowing tabadoa^e lube is 

n^ .o a bigb f,e,d,». eard, nagne, and U» „™n.g„.Ue.be.d, «ift d„ bo»d bio«n,.a,.d 
PCR product are p,.dpi«,ed,o .he ««,, „fd„„b,, Tie supen«d.n. is ,em„«d. ..d d,e 

bufc n,.^s and w^h soiudon are n,i^ a«. d»„ subjec«d once again ■„ d» ^agneUc 
Held u. pn=c.p,u>. .I^ magnetic panicles. n« ,„pen„tt„ ^ ^ ^ ^ 

w«hs«pis«pea«ler.he^M„,^„,^ To rete. «.e „„bi„U„y,a,ed 

s««. fton, d« double-s^nded p^luc. .he beads a.. nux«. wid, an alkaline dena.u«,i„„ 
«h«K,„sucha^O.I MN.OR beads a„= ineuba^da. ™on,».p.,a.„„ f„ ,0 ™i„. which 
denanues *e PCR p,<«luc, a«l ^teases U,. u„bio.in,la.^ pr^luc, i„,„ solution The 
btodnyla^d su™d. bound to the magnetic h«ls is precipi.tcd fro. the solution under d. 
magnetic fteld and unbiotinylated strand. «,„ si^a-stranded, can optionally be transfened to a 
new tube wtd, dK supcnaum, and readied for nm, specnometric analysis. 

The bound single-str^rfed amplified target nucleic acid can be ..leased ftom Ute 
heads using one of a number of diffetcnt procedtues. Tl^e procedures 
.ndude denatutation of biotin/streptavidin bond by heat denatunuion (95 for 5 min.). or dte 
.».of.dena«ran,.sucl,asNaOH(, mM NaOH for 15 mi. at 65?C). and use of a se«»d.„ 
olea^ble site, such as . lisulfidc linkage (I0» mM DTr (dithiodueitol) fur „ min. a, room 
"mpcrature, or a 5' thiolated nucleotide ,0.1 mM AgNO, for ,5 ndn. at room temperature) 
present within the primer. 

Example 4. Mass Modification of Target Nucleic Acids. 

Mass modlHcation of the target nucleic acid is typically perfonr,ed during the 
amplification step. One or more standard deoxynucleoside triphosphates arc replaced with 
modified deoxynucleoside triphosphates. As an example thymidine may be .placed with a 5- 
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aIkynyl-subsiituted.2'-deoxyuridine triphosphate. Because the modified nucleotides may not be 
efficient substrates for DNA polymerase it may be necessary to inciease the concentration of the 
corresponding triphosphate by a factor of 2 to 1 00 over nonnal levek. 

Example 5. Analysis of Single-Stranded AmpUfied Telranucieotide Repeat Region of 
THOIGene. 

A sample of human genomic DNA is subjected to PCR amplification with the primer 
pair: (SEQ ID NO: 2) 5^Biotin^TGATTCCCATT6GCCTGT(sT)CCTC.3' and (SEQ ID NO:2) 
S^ AGfGCAGGTCACAGOGAACACAGAO'. which selectively amplify the tetranucleotide 
repeat itgion of the tyrosihe hydroxylase riHOI") gene to give 90-114 bp PCR products where 
sT is S'-thiolaied thymidine, also known ak 2'-deoxythymidine-5'-(S)-phosphoroihioatc (see FIG. 
16). These PGR amplified target nucleic acids are about a factor of 2 smaller products than 
current commercially available primers provide. The PCR reaction is perfomied on a 50 
scale using 17.5 pmol of each primer and 50-100 ng of template, with 30 3-stcp themal cycles, 
following an initial step at 94»C for 2 mih, of 94°C for 45 s followed by SS-C for 30 s and then 
72'C for 30 s. A final step at 72»C for 5 min is added to complete the reaction. The biotinylated 
product is bound to streptavidin-coated magnetic beads MPG (CPG Inc., Great Nfeck. NY) and 
then subjected to denaturation conditions of 0. 1 M NaOH for 10 minutes. The solid-support- 
bound single-stranded target nucleic acid is then subjected to extensive washing with 10 mM 
ammonium acetate and deionized water. The single-stranded amplified target nucleic acid is 
then released from the beads by cleavage of tiic P-S bond in the cleavable prim** with 0.1 mM 
AgN03. After 2 nL of 100 mM dithiothreitol is added to sequester Uie Ag+ ion. the sample is 
evaporated to dtyricss in a Speed-vac concenu^tor. For analysis, the sample is redissolyed in 
I HL deionized HjO and is mixed with 1 mL of matrix solution consisting of 3-HPA 
(3.hydroxypicolinic acid) in acetonitrile:HjO 1: i . The sample is deposited onto a silicon stage, 
dried uiider a gentle flow of nitrogen and is placed into the mass spectrometer. 

The experimental apparatus used for analyzing the sample amplified target nucleic acids 
is corripdsed of an excitation source, a sample manipulator and a TOP mass spectrometer. Tlie 
excitation source used for desorption is a Nd:YAG laser. The laser is operated at a 10 Hz 
repetition rate, with a 5 nanosecond pulse width. The desorption laser beam maintained at an 
incident angle of 450 is focused onto the sample with a 250 mm focal Icngti, AL-2 coated 
spherical mirror to an elliptical spot size of approximately 100 by 150 nm. A Clan-laser 
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|X)Jarizer (Ne^vport Corporation, Fountain Valley. CA) is plac^ in a rotation stage in the beam 
path for continuously variable attenuation, allowing adjustment of the polarized Nd: YAG laser 
energy density fn,m below 1 nU/cm^ ,o 1 00 m W. The opUmum energy density for desorpUon 
is in the range of 10 to 20 mJ/cm^ 

Mass spectra are recorded in positive-ion mode at room temperature. The sample region 
is evacuated by a 300 liter per second turbomolecular pump. The drift and detection regions are 
evacuated using a cryopump with nominal 1500 liter per second pumping speed. Tl,e base 
pressure-of the chamber is 3 x lO"' Torr. and the normal working pi«sure, within about five 
minutes of sample introduction, is 5 x 1 0"* Tort. A total of 100 laser shots are summed to obtain 
a spectrum. TTie spectrum shown in FIG. 13 reveals two clear peaks corresponding to an 84.mer 
and a 9|.mer. which are the expected product sizes coirespoiiding to 8 and 9.3 repeats with one 
extra adenine base added to the 3'-end of each, due tp the well-known property of Taq DNA 
polymerase to yield one-base over^extensions. The t«^o peaks arise because the final sample, 
though iingle-stranded, derives from amplification ofa heterozygous allele. 

Example 6. Comparison of mass spectra for a single-stranded amplffied THOI ladder of 
nuclde adds and a single^tranded amplified THOI ladder that bave had their lengths 
reddced endonucleaise cleavage. 

A 1 ML sample of THOI ladder (Promega Inc.. Madison, WI) which contains PGR 
products ranging in size from 179-203 bp is reampliHcd in a 100 mL reaction volume according 
to the same amplification protocol as described above for genomic DNA to yield a ladder of 
products in the size range of 90-1 14 bp. One half of the product mixture (amplified tatBct 
nucleic acids) is bound to strepiavidin-coated magnetic beads, denatured, washed and cleaved 
from the beads as described in the previous example. The other half of the product mixture is 
then bound to the streptavidin-coated magnetic beads and washed. Then, 2 pL of 1 OX NEB 
Buffer 4 is added to the second half of the amplified target nucleic acids, followed by the 
addition of 16 ixL of H,0 and 20 units of Nco 1 restriction endonuclease (New England Biolabs. 
Beveriy, MA) recognizing CCATGG. This restriction endonuclease was chosen to reduce the 
length of the amplified target nucleic acid because the amplified target nucleic acid contained a 
Nco I recognition site (in the wild type or consensus sequence) which was located in a flanking 
region and not in the region of interest. The mixmre is incubated at 37"C for I hour after which 
the enzyme is washed away, the amplified target nucleic acids are bound through bioiinylated 
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priifters, denatured, washed aiid single-stranded ampliHed target nucleic acids are then cleaved 
from the beads with AgNOy The restriction enzyme-digested ladder has a size range of 40 to 64 
bases as a result of the enzyme cutting 3 bp past the end of the final CATT repeat. Samples of 
both undigesied/full-length and digested/reduced-length products are prepared for mass 
spectrometry analysis by reducing the volume by evaporation and adding I ^L of 3- 
hydroxypicolinic acid matrix solution and allowing to dry on the sample plate. The resulting 
positive-ion mass specua of the undigested arid digested laddens are shown in FIG. 14 and 
FIG. tS. respectively. The mass resolution of the peaks for the digested ladder (FIG. 15) is much 
greater than that for the undigested ladder (FIG. 14). This is due ttf the use of Nco I to reduce the 
length of the single-stranded amplified target huileic abids that were subjected to mass 
spectrometric analysis. 

. Example 7. Mws Spectrometry Analysis. 

The single-stranded amplified target nucleic acid sample to be analyzed is typically 
mixed with an equal volume of matrix solution consisting of 0.5 M 3-hydroxypicolinic acid (3- 
HPA) and 50 mM diammonium hydrogen citrate. Typically, a 1 portion of the sample is 
applied to the mass speedometer sample stage and allowed to dry under a gentle stream of 
nitrogen gas at room temperature. When the sample has completely dried to foim crystals 
(typically 5 min,) the sample is inserted into the mass spectrometer for analysis The usual 
analysis conditions employ the use of a Nd:YAG laser operating at 266 nm with an average pulse 
energy of 15 mJ/cm'. An average of 100 laser shots is typically used to obtain a spectrum. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by reference. 

The invention now being fully described, it will be apparent to one of ordinary skill in the 
art that many changes and modifications can be made thereto without departing from the spirit or 
scope of the uivention and the appended claims. 
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SEQUENCE USTWa 



<1) GENERAL INFORMATION: 



(i> APPLICANT: 

(A) NAME: GBNETRACB SYSTEMS, INC. 

(B) STREET: 333 Ravenswood Avenue, FN 083 

(C) CITY: Menlo Park 

(D) STATE: OA . • I 

(E) COUNTRY: US 

JF) POSTAL CODE (ZIP) : 94025 
... ^ (G) TELEPHONE: (512) 418-3000 
(H) TELEFAX: (713) 789-2675 

iii) TITLE OF INVENTION: METHODS OP PREPARING NUCLEIC ACIDS FOR MASS 
SPECTROMETRIC ANALYSIS 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dislc 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/759,993 

(B) FILING DATE: 02-DEC-1996 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/032,369 

(B) FILING DATE: 02 -DEC- 1996 

(vi) PRIOR APPLICATION DATA; 

(A) APPLICATION NUMBER: US 08/715.582 

(B) FILING DATE: 19 -SEP- 1996 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : Single 
ID) TOPOLOGY: linear 

Cix) FEATURE: 

(A) NAME/KEY: misc_f eature 
.(B) LOCATION: 1 ^ 

(D) OTHER INFORMATION :/note= «5' end is biotinylated" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied^base 

(B) LOCATION: 20 

(D) OTHER INFORMATION :/note= - 5 • -thio- thymidine" 
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(2) INFORMATION FOR SEQ ID NO: 2: " ^ 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

IXiT SEQUENCE DESCRIPTION: SEQ ID W: 2 : 
AGTGCAGGTG ACAGGQAACA CAGA 

24 
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- (Xi) SEQUENCE DESCRIPTION: SEQ ID NOs 1: 
GTGATTCCCA TTGGCCTGTT CCTC 
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1 . A method of detcrniining the mass of a target nucleic acid by mass spectrometric analysis 
comprising: 

5 a) identifying a target nucleic acid, wherein the taiget nucleic acid comprises a 

region of interest and one or more flanking legions; 

b) reducing the length of the target nucleic acid by cleaving at least a portion of one 
or more of said flanking regions to produce a reduced-length target nucleic acid; 

c) obtaining a single-stranded reduced-length target nucleic acid; and 
10 d) deteminingthemassofthesingle-stiOT 

using a mass specux>meter. 



2. The method of claim 1 . fiirther comprising amplifying the target nucleic acid prior to 
15 reducing the length of the target iiucleic acid to produce an amplified target nucleicacid. 



3. The method of claim 1, wherein the reducing step comprises using a restriction 
endonuclease capable of cleaving at a cleavable site. 

20 



4. The method of claim 3, wherem the restriction endonuclease is a Type IIS restriction 
endonuclease. 



25 



5. The method of claim 3. wherein the restriction endonuclease is a Type II restriction 
endonuclease. 



30 6. The method ofdaim 3, wherein the amplifying step comprises using a cleavable primer 
comprising a recognition site for the restriction endonuclease. 
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7. The method of claim 2. wherein the amplifying step comprises using a cleavable primer 
comprising a cleavable site. 



8. Tbemethod of claim 7, wherein the reducing step comprises treating theamplified target 
nucleicacidswitha5*to3'exonuclease. : , ; ;> 



iO 9. 'n^n.etfaodofclaimT.wheidnthered^m^^ 
cleavable site. 



10. T>«'nethpdofclaim 9. wherein the chemically cleavabfe«^^^^ 
15 a modified sugar, or a chemically cleavable gmup inconx,«ted into the phosphate backbone: 



1 1. Tlie method of claim 1 0. wherein the chemically cleavable site comprises a chemically 
cleavable group. 



20 



12. The method of claim 1 1. wherein the chemically cleavable ^roup comprises 
dialkoxysilane. 3'-(S)-phosphorothioatc, 5-(S)-phosphorothioate. 3'.(N)-phosphoroamidate. or 



S'-(N)-phosphoroamidate 

25 



30 



13. Themethodofclaiml2,wheteinthechemicanycleavablegr6upcomprises3'-(S)- 
phosphorothioate or 5'-(S)-phosphonrthioate. 



14. method ofclaim 10, wherein the chemically deavable site comprises a modified 

sugar. 
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15. The method of claim 14, whensin the modified sugar comprises ribose. _ 



16. The method of claim 2, wherein the amplified target nucleic acid lunher comprises a fint 
strand and a second complementary strand and the obtaining step comprises: ■. ■■ 



10 b) 



attaching the first strand of the amplified target nucleic acid to a solid support; 
and 

separating the first strand from the second strand to produce a bound first strand 
and in unbound second strand, wherein the mass of the unbound second strand is 
deterinined using a mass spectrometer. 



15 17. TTie method of claim 16. wherein the reducingstepcomprises.using a restriction 
endonucieasb cajjoble of cleaving at a cleavable site. 



1 8. The method of claim 1 7, wherein the restriction endonuclease is a Type IIS restriction 
20 endonuclease. 



19. The method of claim 1 7. wherein the restriction endonuclease is a Type 11 restriction 
endonuclease. 

25 



20. The method of claim 1 7, wherein the amplifying step further comprises using a cleavable 
primer comprising a recognition site for the restriction endonuclease. 



30 



21. 'n»emethodofclaiml6,whereintheamplilyingstepfurth«comprisesusingacl^^^^ 
primer comprising a cleavable site. 
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target nucleic acids with a 5' to 3' exonuclease. ««nP'Mieu 



20 L 

20 dialkoxysilane, 3'-(S).phosphorothioate 5- ^«5^»h k u- 



25 !1 ,™'r'°'°'''""''''*°^'*""'^"'*»"P""PH«s3MS> 
25 '"""PhorataoaieorS'^j-phosphoTOMoate. ' 



28. ^-'■^»f'*>to27.w!^„U„a««^i,„^^^- 
«W».coa.in5»i<li„„rslrep.avidm. »*=«u, .he «,M 
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29.- The method of claim 28 wherein the amplified target nucleic acid comprises a mass- 
modified nucleotide. 



30. The method of claim 25. wherein the chemically cleavable site comprises a modified 
sugar. 



31. The method of daun 30, wherein the modified sugar comprises ribose. 



32. The method of claim 1 6. wherein the first strand is biotinylaied and, wherein the solid 
support contains avidin or streptavidin. 



33. The method of claim 16. wherein the amplified target nucleic acid comprises a mass- 
modified nucleotide. 



34. The method of claim 16, wherein said detennining step further comprises utilizing 
internal self-caiibrants. 



35. The method of claim 2, wherein the amplified target nucleic acid comprises a first strand 
and a second complementary strand and the obtaining step comprises: 

a) attaching the first strand of the amplified target nucleic acid to a solid support: 

b) separating the first strand from the second strand to produce a bound first strand 
and an unbound second strand: 

c) removing the unbound second strand; and 

d) releasing the bound first strand from die solid support to produce a single- 
stranded reduced-length amplified target nucleic acid for mass specttal analysis. 
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36. nen^otcMm 35. ^ fte «I«i„g s^, comprises Mi.* . cl«v.bl. Ifakcr. 



5 37. ^--"XoiofC^iS.^^.^^,^^^^^^ 
eKtooiKleaa MiMble of deaviiie at a deavable site. 



.0 ILonJl"^'''*^''-'*^'^^— 



3». Tie metlKrf „,ctai,„ 37. ,«,ric,io„ «rf„„^^ 13 , „ 

endonuclease. 

15 



40. ^«-eU,odofcIaim37.wherein,hean,pHfyi„gstepfurthercompr^^^ 
pnmer comprising a recognition site for the restriction endonuclease. 



20 



41. The method of claim 35. wherein the amplifying step further comprises using a cleavable 
pnmer comprising a cleavable site. 



25 42. •"'enH.thodcfcIaim 41, wbe«iri tt,e reducing step comprises^U^^^^^ 
target nucleic acids with a 5' to 3* exonuclease. 



30 



43. The method of claim 41. whe«.in the cleavable site comprises a chemically cleavable site. 
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44. The method of claim 43. wherem the chemically cleavabic site comprises a modified 
base, a modified sugar, or a chonically cleavable group incorponited imo the phosphate 
backbone. 



41 The method of claim 44. wherein the chemically cleayable site comprises a chemically 
cleavable group. 



10 46. Thcmethodofclaim45,whefeinthcchemicallycleayablegroupconiprises 

dialkoxysilanc. 3'-(S)-phospha«>thioa.e. 5'-CS)-phosphoiothioate. 3 Wphosphoroamidate, or 



5'-(N)-phosphoroamidale. 



15 47. Themethodofclaim46.whereinthcchemicallycleavablegroupcomprises3'^^^^ 
phosphorothioate or 5'.(S)-phosphbrothioatc. 



48. The method of claim 47, wherein the firs, strand is biotinyiated and. wherein the solid 
20 support contains avidin or streptavidin. 



49. The method ofclaim 48. wherein the amplified target nuclcic^acid comprises a mass- 
modified nucleotide. 



25 



50. The method of claim 44. wherem the chemically cleavable site comprises a modified 
sugar. 



30 



method of claim 50. wherein the modified sugar comprises ribose. 
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52. -^'-^or^is.^^^^g^^^^^^,^^^^^ 

support contains avidin or streptavidin. 



modified nucleotide. ■ 



'0 54. ^.'^oU^ls.-^s^^,^^^^^^ 

mlenialself«lilraiits. *^ uunzmg 



55. Am«hcd„,p^,.^^,^^^_^^_^^._^^^^___^ 
15 analysis comprising: - 

a) amplifying a target nucleic acid to produce an amplified targe, nucleic acid 
wherein the amplified target nucleic acid comprises a first strand and a second 
complementary strand; 

b) .. «^-g . he first strand of the aniplified target nucleic aci^^ 
produce a bound first strand and an unbound second strand; " 

c) removing the unbound second strand from the bound fir^st strand- 

d) -"easing the bound first strand to the solid support to f^^ 
amplified target nucleic acid; arid 

e) d-^termining the mass of the single-stnuuiedampUfied target nuck^^^ 



20 



25 



mass spectrometer. 



30 



56. Th6methodofclaim55.whei.i„,hereieasingstepcomprisesu^ngacl^^^^^^ 



57. 7T«„.ethodofclaim56.w^^ 
amplified target nucleic acid. 
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58. the method of claim 55, wherein the releasing step comprises using a restriction 
endonuclease C2q)able of cleaving at a cleavable site. 



59. the method of claim 55, wherein the restriction endonuclease is a type HS restriction 
endonuclease. 



60. the method of claim 59. yvherein the restriction endpnuclease is a type II restriction 
endonuclease. 



61. the method of claim 58, wherein the ampUfying step further comprises usihga cleavable 
primer comiJrisii^ a recognition site for the restnctioaendonuclease. 



62. the method of claim 55, wherein the .amplifying step further comprises using a cleavable 
primer comprising a cleav£d)le site. 



63. the method of claim 62, wherein the releasing step wmprises treating the ainplified 
target nucleic acids with a 5" to 3" exonuclease. 



64. the method of claim 62, wherein the releasing step comprises cleaving at a chemically 
cleavable site. 
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65. - TT«™ethodofcIaim64,wherdnttechenucallycIeavables 

base, a modified sugar, or a chemically cleavable g^up incorpomcd into the phosphate 

backbone. ^ 



66. method of claim 65, wherein the chemically cleavable site comprises a chemically 

cleavable group. 



10 67. ™«™>l»dofolaim66,wheretathedKn«c^lycleay.blegnwc<)mpri« 

5*-(N)-phosphoroainidate. 



15 68. ^ T>enH«hpdpfclaim6^vteih.hed,^^ 
phosphorothioate or 5'-(S).phosphorothioate, 



69. The method of cI«m69,whe™n the fim strand is biotinylated and. ^ 
20 support contains avidin or streptavidin. 



70. Tte^iv^thod of claim69. wherein the amplified^u^^ 
modified nucleotide. 



25 



71. n,e method of claim 65. wherein the chemically cleavable site comprises a modified 
sugar. 



30 

72. The method of claim 71 , wherein the modified 



sugar comprises ribose. 
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73. The method of claim 55, whorein the first strand is biotinylaled and, wherein the solid 
support contains avidin or streptavidin. 



74. The method of claim 55, wherein the amplified target nucleic acid comprises a mass- 
modified nucleotide. 



75. The method of claim 55, wherein said determining step further comprises utilizing 
internal self-caiibrants. 



76. A kit for preparing a double-stranded target nucleic acid having a first strand and a 
second complementary strand for mass spectrometric analysis comprising: 

a) a first primer capable of binding the first strand of the target nucleic add 5' to a 
region of interest of the target nucleic acid; 

b) a second primer capable of binding the second strand of the target nucleic acid 5' 
to the region of interest of the target nucleic acid; 

c) a DNA polymerase; and 

d) a restriction endonuclease . 

77. The kit of claim 76, wherein the restriction endonuclease is a Type II restriction 
endonuclease. 



78. The kit of claim 76, wherein the restriction endonuclease is a Type IIS restriction 
endonuclease. 



79. The kit of claim 76, wherein the first primer is biotinylated. 
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80. The kit of claim 76. further comprising a solid support capable of selectively binding the 
first strand of the target nucleic acid. 

5 



81. The kit of claim 76, further comprising a matrix. 



10 82. The kit of claim 81, wherein the matrix comprises 3.hydroxypicollnic acid. 



83. A kit for preparing a double-sttanded target nucleic acid having a first strand and a 
second cdinpleinentary strand for mass spaitibmetric analysis comprising: 
15 a) afirstprimercapableofbindingthcfirststrandofthetargetnucleic^a^^^ 

region of interest of the ^t nucleic add. whfeicin said finrt prima comprises a cleavable 
primer; 

b) a second primer capable of binding the second strand 5' to the region of interest of 
the target nucleic acid; and 



20 



c) ■ a DNA polymerase. 



84. The kit ofclaim 83. wherein the cleavable primer comprises a chemically cleavable sit^^ 

25 

85. The kit of claim 84, wherein the chemically cleavable site comprises a modified base, a 
niodifiedsugar.orachemicallycleavablegroupincoiporatedintothephosphatebackbone. 



30 86. The method of claim 85, wherem the chemically cleavable site comprises a chemically 
cleavable group. 
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10 



87. The kit of claim 86. wherein the chemically cleavable group comprises dialkoxysilane. 3'. 
(S).phosphorothioale. 5HS).phosphorothioate. 3'-(N)-phosphoroamidate. or 5'^. 
phosphoroamidate. 



88. The kit of claim 87. wherein (he chemically cleavable group comprises 3'-(S)- 
phpsphorothioaie or 5'-{S)-phosphprothioate. 



89. The kit of claim 83 . wherein ihe cleavable primer comprises an exonuclease blocking 
moiety. 



15 90. The kit of claim 83, whet^in thecleavableprimer comprises a Type IIS restriction 
endonuclease recognition site. 



91. The kit of claim 83 further comprising a solid support capable of selectively binding the 
20 first strand ofihe target nucleic acid. 



92. The kit of claim 91, wherein the first strand is biotinylated. 

25 

93. The kh of claim 83, further comprising a matrix. 



94. n;e kit of claim 93. wherein the matrix comprises 3-hydroxypicoIinic acid. 
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